DNA compounds comprising sequences encoding mannuronan C-5-epimerase

ABSTRACT

PCT No. PCT/NO93/00151 Sec. 371 Date May 9, 1995 Sec. 102(e) Date May 9, 1995 PCT Filed Oct. 8, 1993 PCT Pub. No. WO94/09124 PCT Pub. Date Apr. 28, 1994DNA compounds encompassing sequences coding for enzymes having mannuronan C-5-epimerase activity are disclosed and a process for the preparation of such enzymes. The genetic sequences and enzymes prepared may be used in the production of alginates having a definite G/M ratio and block structure. Alginates having a definite G/M ratio may also be produced by selective inactivation of the genetic sequences.

The present invention concerns DNA compounds encompassing sequences coding for enzymes having mannuronan C-5-epimerase activity, a process for the preparation of such enzymes, the use of said genetic sequences in production of alginates having a definite G/M ratio and block structure, and the production of alginates having a definite G/M ratio by inactivating said genetic sequences.

Throughout this application, reference is made to publications from the scientific and patent literature. Publications so cited are hereby incorporated in their entirety by reference.

In this application the term gene is used to indicate a genetic sequence which encodes a protein, independent of whether the protein encoded by this genetic sequence is expressed or not in the natural host organism under those conditions.

Alginates are a family of polysaccharides, which are synthesized in brown algae as well as in bacteria, such as Azotobacter vinelandii and Azotobacter chroococcum. Alginates are also synthesized by some strains of Pseudomonas sp.

Chemically, alginates are unbranched binary copolymers of 1-4 linked β-D-mannuronic acid, termed M, and its C-5 epimer α-L-guluronic acid, termed G.

Alginates derived from seaweeds and Azotobacter are generally true block copolymers where the monomers are arranged in homopolymeric stretches of M, termed M blocks, and homopolymeric stretches of G, termed G blocks, interspaced with regions containing both monomers, normally termed alternating blocks or MG blocks. The composition and sequential structure of alginates vary widely depending on the source. Alginates produced by Pseudomonas, however, do not have any G blocks.

Several functional properties such as the capacity to form gels and the binding of water depend on the M/G ratio and on the length of the various blocks. A relatively high content of G blocks, for instance gives good gelling properties, due to ionic cross linking of chains which takes place when Ca²⁺ -ions are added to an alginate solution. The composition and block structure also influence on the immunological properties of alginates. Otterlei et al, J. of Immunotherapy 10, 286-291, (1991)! have shown that alginates with a high content of mannuronic acid blocks are very potent nontoxic immunostimulants.

At present industrial production of alginates rely exclusively on algal sources. The range in composition is however limited as the highest content of guluronic acid to be found is 75% and the lowest 25%. Furthermore there are no suitable sources for alginate with a G content in the range of 42-54%. In the field of biotechnology or biomedicine, polymers with extreme compositions, such as a high G for immobilization of cells, Martinsen A., Skjåk-Br.ae butted.k G. and Smidsr.o slashed.d O., Biotechnol. Bioeng. 33, 79-86, (1989)! and a high M (90-100%) as immunostimulants Otterlei et al, J. of Immunotherapy 10, 286-291, (1991)! are of major interest.

The key enzyme responsible for generation of the G blocks is called mannuronan C-5-epimerase. It was previously thought that only one enzyme having a certain amino acid sequence would exhibit this activity. It has now surprisingly been found that there exist at least five genes encoding enzymes having this activity. Some of these enzymes differ in molecular weight and amino acid sequence. The genes were found adjacent to each other in the bacterium Azotobacter vinelandii. It has also been found that the amino acid sequence of the enzyme affects the activity of the enzyme, not only in terms of potency but also in the type of alginate formed, for example, altering the content of guluronic acid and the single/block G content of the alginate.

In Larsen, B. and Haug, A., Carbohydr. Res. 17, (1971), 287-296 and 297-308! the isolation of mannuronan C-5-epimerase from liquid cultures of Azotobacter vinelandii is reported. In the following, this epimerase will be termed mannuronan C-5-epimerase (2), and the DNA sequence encoding for it will correspondingly be denominated E2.

In Skjåk-Br.ae butted.k, G and Larsen, B, Carbohydr. Res. 103:133-136, (1982)!, the purification of mannuronan C-5-epimerase (2) by affinity chromatography on alginate sepharose is disclosed. In a separate paper Skjåk-Br.ae butted.k, G and Larsen, B., Carbohydrate Research, 139, (1985) 273-283! the characterization of this enzyme is disclosed. Further, the activity of the enzyme is described as an ability to epimerize both bacterial and algal alginate having a wide range in monomer composition and sequence of units.

From PCT/WO 86/03781 and Japanese Patent Application J63233797 it is known to produce alginic acid and/or alginate having a high content of guluronic acid by action of the enzyme (E2) on an alginic acid or alginate, whereby the G content increases.

In Chitnis, C. E. and Ohman, D. E., J. Bacteriol., 172, p2894-2900, (1990)! the gene sequences involved in the introduction of guluronic acid into exopolysaccharides from Pseudomonas aeruginosa have been reported. However, the nature of the enzyme responsible for this process has not been identified. Since this genus of bacteria is unable to produce alginate containing G blocks Skjåk-Br.ae butted.k, G., Larsen, B. and Grasdalen, H. Carbohydr. Res. 54 (1986) 169-174! it is believed that the epimerization system in alginate producing Pseudomonas is fundamentally different from the epimerase in brown algae and in Azotobacter vinelandii. It seems likely that the Pseudomonas enzyme is a monomer epimerase acting at the sugar nucleotide level, and as such is unable to introduce G-blocks into already polymerized alginates.

Production of mannuronan C-5-epimerase from Azotobacter vinelandii culture is difficult due to a very low yield. It is also a major obstacle that the enzyme is secreted together with copious amounts of highly viscous alginate which hampers the purification of the enzyme. Although alginates are secreted by some bacteria, an industrial production based on these microorganisms has not been successful. The main reasons are due to the difficulties in controlling the composition and molecular size of the exopolysaccharides. The content of guluronic acid blocks in the alginate from Azotobacter vinelandii tends to be too low for making a polymer with good gelling properties.

Alginates with a high M content having immunogenic properties as reported above, are produced by Pseudomonas aeruginosa, but this organism is unattractive from a production point of view, as it is unstable in the production of the polymer. Further, the organism is known to be a secondary pathogen in patients suffering from cystic fibrosis.

Thus, in order to produce medical grade alginates with a defined monomer composition and sequential structure there is a need for improved methods for controlling the biosynthesis of alginate, through controlling the key enzyme, the mannuronan C-5-epimerase.

The present invention is directed to cloned DNA fragments encoding mannuronan C-5-epimerase. The invention is encompassed by vectors which contain DNA fragments encoding mannuronan C-5-epimerase linked to DNA elements which direct the expression of mannuronan C-5-epimerase from the cloned DNA encoding the protein. The invention also provides for microorganisms which express the mannuronan C-5-epimerase protein from the cloned DNA as a source of the purified protein and also as a source of alginates of altered composition. Strains in which the expression level of the mannuronan C-5-epimerase gene is altered or in which one, several or all of the mannuronan C-5-epimerase genes have been inactivated are also within the scope of the present invention. The invention further encompasses methods for producing alginates either very efficiently, or having altered composition, or both, by culturing microorganisms having altered levels of expression of a mannuronan C-5-epimerase gene.

The invention further features selection of epimerase to achieve a desired level of guluronic acid, and alter the single/block G characteristics of the enzyme. In a further embodiment, the invention features the production of synthetic proteins and DNA encoding such proteins which have mannuronan C-5-epimerase activity.

BRIEF DESCRIPTION OF FIGURES

FIG. 1 shows the amino acid sequence SEQ ID NO:6 of the N-terminal end of the 122 kd protein, and the nucleotide sequence of the corresponding oligonucleotide SEQ ID NO:16. The DNA probe was synthesized as a mixture (in equal ratios) of the 64 possible combinations that could be deduced from the first seven amino acids in the sequence of the 122 kd protein. N indicates that all four bases were used at this position.

FIG. 2 is a restriction endonuclase map of the combined inserts in plasmids pHE14, pHE16, pBD1, pHE18 and PML1.

The numbers at the bottom line indicate the molecular sizes in bp. The arrow indicates the localization and orientation of the sequence homologous to the synthetic oligonucleotide used for screening the library. The five genes (open reading frames) found by sequencing are marked by boxes and denoted E4, E1, E2, E3 and E5. E1 corresponds to Epimerase I.

FIG. 3 shows Mannuronan C-5-epimerase (1) activity of a portion of the E1 encoded protein as a function of cell growth. *: OD₆₀₀ of cell culture. o: Epimerase activity given as dpm/ml of cell culture. The strain used in this experiment was DH5α(pHE5), and the extracts were incubated with the substrate for 23 hours.

FIG. 4 shows the kinetics of ³ H release. The enzyme activity was assayed by using an extract prepared from IPTG-induced JM105 cells containing pHE5 (see legend to Table 3).

FIG. 5 shows the homologies between and within the different genes. Boxes with the same letter are homologous to each other. Gaps are introduced to optimize the alignment. E1-E4 are defined as appears from FIGS. 2 and 6.

FIGS. 6A-6E (SEQ ID NO:7) shows the alignment of the DNA sequences of the A blocks from E4, E1, E2 and E3 SEQ ID NO:7=Con, SEQ ID NO:17=E4A, SEQ ID NO:18=E1A1, SEQ ID NO:19=E1A2, SEQ ID NO:20=E2A, SEQ ID NO:21=E3A1, SEQ ID NO:22=E3A2.

FIG. 7A-7B (SEQ ID NO:8) shows the alignment of the deduced amino acid sequences of the A blocks from E4, E1, E2 and E3 SEQ ID NO:8=Con, SEQ ID NO:23=E4A, SEQ ID NO:24=E1A1, SEQ ID NO:25=E1A2, SEQ ID NO:26=E2A, SEQ ID NO:27=E3A1, SEQ ID NO:28=E3A2.

FIGS. 8A-8D (SEQ ID NO:9) shows the alignment of the DNA sequences of the R blocks from E4, E1, E2 and E3 SEQ ID NO:9=Con, SEQ ID NO:29=E4R1, SEQ ID NO:30=E1R1, SEQ ID NO:31=E1R2, SEQ ID NO:32=E1R3, SEQ ID NO:33=E1R4, SEQ ID NO:34=E2R1, SEQ ID NO:35=E2R2, SEQ ID NO:36=E2R3, SEQ ID NO:37=E2R4, SEQ ID NO:38=E3R1, SEQ ID NO:39=E3R2, SEQ ID NO:40=E3R3.

FIG. 9A-9B (SEQ ID NO:10) shows the alignment of the deduced amino acid sequences from the R blocks of E4, E1, E2 and E3 SEQ ID NO:10=Con, SEQ ID NO:41=E4R1, SEQ ID NO:42=E1R1, SEQ ID NO:43=E1R2, SEQ ID NO:44=E1R3, SEQ ID NO:45=E1R4, SEQ ID NO:46=E2R1, SEQ ID NO:47=E2R2, SEQ ID NO:48=E2R3, SEQ ID NO:49=E2R4, SEQ ID NO:50=E3R1, SEQ ID NO:51=E3R2, SEQ ID NO:52=E3R3.

FIG. 10A-10C shows ¹ H-NMR spectra of alginate epimerized by extracts from A: DH5α(pHE8) (truncated epimerase 1); B: JM109(pBD9); C: no extract. The peak to the left gives the signal from G-1; the peak in the centre gives the combined signal from GM-5 and M-1 and the peak to the right gives the signal from GG-5.

FIGS. 11A-11F (SEQ ID NO:1) shows the nucleotide sequence and corresponding amino acid sequence of E2.

Now according to the present invention genetic sequences have been found which encode enzymes having mannuronan C-5-epimerase activity (MG copolymers), and thus the first aspect of the invention is pure isolated DNA comprising nucleotide sequences encoding mannuronan C-5-epimerase activity (GGhorropolymers).

The sequence of amino acids proximal to the amino terminus of purified mannuronan C-5-epimerase protein was determined G. Skjåk Br.ae butted.k et al., Carbohydr. Res. 103:133-136 (1982)!. This data was used to derive a sequence for an oligonucleotide probe which was used to screen a gene library of Azotobacter vinelandii DNA. One result of this screening experiment was the surprising discovery of a second gene and thereafter three further genes including at least one genetic block A were found. Thus, altogether there appear to be at least five different genes encoding proteins having mannuronan C-5-epimerase activity. Accordingly, it is a second object of the present invention to provide for alternative DNA sequences encoding mannuronan C-5-epimerase.

Three different blocks of genetic sequences, designated A, R and S, are found in the genes. These genetic blocks are most commonly found in combinations wherein the A appears one time or two times, the R block appears from 0 to at least 5 times and the S block appears from 0-1 time.

There is a high degree of consensus in the nucleotide sequences of each block for the different genes (1-5). Accordingly, it is a third object of the present invention to provide for DNA sequences encoding mannuronan C-5-epimerase and comprising the DNA blocks A and/or S and/or R, wherein A may appear more than once and R if present may appear singly or in repeats of up to at least 5 or 6 times.

The consecutive order of the three blocks if all three blocks are present, is preferably A, R and S. However, it has been shown that a reversed consecutive order, wherein for instance R appears before A also gives a gene encoding a mannuronan C-5-epimerase. Thus, the invention further encompasses genetic sequences having any order and any number of the blocks A, R and S.

Another aspect of the present invention concerns the use of said genetic sequences for the preparation of the mannuronan C-5-epimerase in recombinant host cells. It is especially preferred to insert the gene into hosts such as bacteria, for instance Escherichia coli or Bacillus subtilis or in yeast. The cloning and expression of the genetic sequence as described above in E. coli is described in the Examples.

The present invention also encompasses recombinant expression plasmids that can be used to produce the mannuronan C-5-epimerase proteins in a host microorganism. Such expression plasmids are made by inserting a DNA fragment encoding mannuronan C-5-epimerase into a vector which contains appropriate expression elements, such as (but not limited to) a promoter, ribosome binding site, translational initiation site and transcription terminator. The expression plasmids can be adapted for transformation into many different commonly used host organisms in which it might be desired to produce the mannuronan C-5-epimerase.

The techniques for insertion of foreign genes into commonly employed hosts are known in the art, as described for instance in METHODS IN ENZYMOLOGY, Vol. 185, Gene Expression Technology, Ed. D. V. Goeddel, Academic Press, Inc. (1990)!. Further by choice of a broad host range vector and a suitable promoter as known in the art, and described for instance in J. L. Ramos et al, FEBS Letters, Vol. 226, 2, 241-246! it will be possible to insert and express the mannuronan C-5-epimerase genetic sequences in many different hosts.

This will make possible the production of large quantities of one or all of the pure enzymes having this activity, while avoiding the problems of separating the enzymes from the alginate.

By inserting a high copy-number vector comprising the genetic sequences encoding the epimerase into a natural alginate producing bacterium such as Azotobacter vinelandii an enhanced production of the enzymes would be possible.

Over expression of the epimerases in a natural host could also be achieved by using a promoter which drives high-level expression of the enzymes. By blocking other genetic sequences coding for the alginate production, the production of pure enzymes may be achieved.

Yet another aspect of the invention is the selective inactivation of the mannuronan C-5-epimerase genes in the natural host organism so as to provide for bacterial production of alginates having a low content of G blocks or even a pure poly-M alginate. This is accomplished by inserting nucleotides into one, several or all of the mannuronan C-5-epimerase genes in the natural host organism Azotobacter. It is especially preferred to insert a DNA fragment encoding a selectable marker gene, preferably a gene conferring antibiotic resistance. Insertion of a selectable marker allows selection of those bacteria in which the insertion has been successfully accomplished. By choosing different selectable markers, for example providing resistance to different antibiotics, it is possible to select recombinants that have incorporated inserted sequences into some or all of the mannuronan C-5-epimerase genes. Thus, selective production of bacterial strains in which one of the mannuronan C-5-epimerase genes, several or all of them have been inactivated is possible.

A second method of inactivating all of the epimerase genes is to transform a cell of the natural host strain, Azotobacter with a vector which expresses an antisense RNA which specifically binds to mRNA transcribed from these genes. Use of promoters of varying strength to drive expression of the antisense RNA in the creation of the vectors used to transform the cells allows production of strains having varying ratios of G-blocks to M-blocks in the alginate produced. Use of inducible promoters to drive expression of the antisense RNA allows the creation of strains which can produce alginates of variable composition, depending on culture conditions. Clearly, if the recombinant host organism is the natural host, Azotobacter, it is possible to enhance production of one epimerase gene while leaving expression of the others at their normal level, thus producing a strain which makes an alginate having an altered ratio of G blocks to M blocks. A strain which makes alginate having 0-25% M blocks is preferred.

Alternatively, all but one of the epimerase genes can be inactivated, as described above, and the expression of the remaining epimerase gene can be controlled by a regulated promoter. A strain carrying such a complement of epimerase genes would thus produce alginates having a high content of G-blocks, especially from 75-98%. Another means for making a strain for producing alginates having a high G-block content is to inactivate all but one of the mannuronan C-5-epimerase genes by insertion and control the remaining gene by antisense RNA, using an inducible promoter to regulate transcription of the antisense RNA gene. A still further means for making a strain for producing alginates having a high G-block content is by inactivating all naturally occurring genes and introducing a regulated gene through a vector.

Thus the present invention also includes a process for the construction of a recombinant host cell capable of expressing mannuronan C-5-epimerase activity by transforming said host cell with a recombinant DNA expression vector that comprises: (a) a promoter and translational activating sequence that function in said host cell; and (b) a DNA sequence encoding mannuronan C-5-epimerase comprising at least a DNA block A and/or a DNA block S and/or a DNA block R, positioned for expression from said promoter and translational activity sequence.

Also the present invention encompasses a process for the bacterial production of pure poly-M alginate or tailored alginates having a lower G block content, preferably in the range from 0-25%, by blocking the DNA sequences encoding the enzymes in a natural host by insertion of a foreign genetic sequence into one, several or all genetic sequences encoding mannuronan C-5-epimerase.

Other methods for achieving the same end will be known for persons skilled in the art and are hereby included into the scope of the present invention.

A further aspect of the invention are the novel enzymes having mannuronan C-5-epimerase activity. The amino acid sequences and their degree of homology will appear from FIGS. 6-9 (SEQ ID NOS:7-10 and SEQ ID NO:1).

Also as known by a person skilled in the art, variations in the nucleotide sequence which nevertheless encode proteins having the same activity as the wild-type mannuronan C-5-epimerase are encompassed within this invention.

Variations within the amino acid sequence may also encompass deletions, substitutions and additions which do not substantially change the biological activity.

Also, it is possible to make a synthetic DNA sequence encoding a mannuronan C-5-epimerase by techniques well known in the art. See for instance, Itakura et al., Science 198:1056 (1977)! and Crea et al. (Proc. Natl. Acad. Sci. USA 75:5765 (1978)! and also U.S. Pat. Nos. 4,800,159 and 4,683,202 and also published European patent application EP-A-0258017. Synthetic enzymes may be made by incorporating different combinations of the A, R and S elements, to maintain epimerase activity. The resultant alginate composition can be varied by enzyme selection.

Materials and General Methods

Bacterial strains, plasmids, and phage. Strains, plasmids, and phages are listed in Table 1.

The bacterial strain of A. vinelandii used in these experiments, is freely available from Bj.o slashed.rn Larsen, Inst. of Biotechnology, Lab. for Marine Biochemistry, 7034 Trondheim--NTH, Norway or Svein Valla, Unigen, Center for Molecular Biology, University of Trondheim, 7005 Trondheim, Norway and has been deposited Oct. 4, 1993 at the Belgian Coordinated Collections of Microorganisms (BCCM) at the Laboratorium voor Microbiologie (LMG) at Universiteit Gent (RUG) under the accession number LMG P-14235,

K. L. Ledeganckstraat 35, B-9000 Gent. Other strains of A.vinelandii mentioned in Example 9 have the following ATCC numbers: ATCC 478, ATCC 12837 and ATCC 12518. Plasmids/strains DH5α(pHE14), JM109(pHE16), JM109(pBD1), JM109(pHE18) and SURE™(pML1) have been deposited Oct. 5, 1993 at BCCM at the Laboratorium voor Moleculaire Biologie (LMBP) (same address as LMG) and have the following accession numbers LMBP 2932, 2933, 2934, 2935 and LMBP 2936.

Growth of bacteria and phages. A. vinelandii was grown at 30° C. with shaking in a nitrogen-free medium (9.8 mM K₂ HPO₄ /KH₂ PO₄, 0.8 mM MgSO₄ 7H₂ O, 3.4 mM NaCl, 0.34 mM CaCl₂, 8.7 μM Na₂ MoO₄ 2H₂ O, 54 μM FeSO₄ 7H₂ O, 1% sorbitol, pH 7.4). E. coli was grown in LB-medium Sambrook J, Fritsch, E. F. and Maniatis T., Molecular cloning, A laboratory manual, 2nd ed., Cold Spring Harbour Laboratory Press, New York, (1989)! with shaking at 37° C. When the cells were to be used for growth of phages the LB-medium was supplemented with 2.5 mM CaCl₂, 10 mM MgCl₂, and 0.4% maltose. Phages were plated on strain Q359 on L-agar (LB-medium supplemented with 2% agar). Phage LB-medium supplemented with either 0.8% agar (titrations and gene library amplification) or 0.8% agarose (screening of gene library and preparation of phage lysates) was used for overlaying agar.

Standard recombinant DNA technology. Restriction endonuclease digestions, removal of cohesive DNA ends by using the 3' exonuclease activity of T4 DNA polymerase, ligations, agarose gel electrophoresis, and end-labelling with ³² P were performed according to standard protocols Sambrook J, Fritsch, E. F. and Maniatis T ., Molecular cloning, A laboratory manual, 2nd ed., Cold Spring Harbour Laboratory Press, New York, (1989)!. Transformations were performed as described by Chung, C. T., Niemela S. L. and Miller R. H., Proc. Natl. Acad. Sci USA, 86, 2172-2175, (1989)!, and DNA sequencing was performed according to Sanger F., Nicklen S., and Coulsom, A. R., Proc. Natl. Acad. Sci USA, 74, 4563 (1977)!.

Viscosimetric measurement. The alginate used in this experiment was obtained from Ascophyllum nodosum and had an intrinsic viscosity in 0.1 M NaCl of 17.6 dl/g at 25° C. The viscosity was determined by an Ubbelhode viscosimeter.

NMR spectroscopy. The substrate used in these analyses was a low guluronic acid-containing alginate obtained from the brown algae Ascophyllum nodosum, and was prepared as described previously Larsen, B., Proceedings of the Tenth International Seaweed Symposium, Ed: Levring, T.Gothenburg, p7-33, (1980)!. For the NMR analyses epimerase was obtained from IPTG-induced E. coli JM105 cells containing pHE5. 250 ml cell culture were harvested by centrifugation and resuspended in 20 ml of 10 mM Tris, 0.34 mM CaCl₂, pH 7.0. After ultrasonication, the solution was centrifuged at 31.000×g for 1 hour. The supernatant was stored frozen at 70° C. After thawing the supernatant was filtered through a membrane with pore size 0.22 μm, and the enzyme was further purified on a Mono Q HR515 (Pharmacia) ion exchange column. The enzyme was eluted with a 0-1 M NaCl salt gradient (in the same buffer as the applied solution), and was collected in 2 ml at approximately 0.6 M NaCl. To each of two tubes was added 0.28 ml of this enzyme solution (0.9 mg/ml total protein), 1 ml alginate (7.5 mg/ml in H₂ O), and 4.62 ml 2,3,6-trimethylpyridine buffer (see above). CaCl₂ was then added to a total reaction volume of 6 ml such that one tube contained 0.85 μM, and one contained 3.4 mM CaCl₂. After incubation at 30° C. for 20 hours Na₂ EDTA (10 mM) was added to chelate the Ca²⁺ -ions, and the solutions were then dialyzed extensively against distilled water. The dialyzed alginate solutions were freeze-dried and then dissolved in D₂ O. NMR spectroscopy of these solutions were finally performed according to Grasdalen H., Larsen B., and Smidsr.o slashed.d O., Carbohydr. Res., 68, 23-31 (1979)! (Table 4). Further analysis was carried out in a similar fashion for DH5α(pHE8), JM109 (pHE16) and JM109(pBD9). The results in Table 4 conclusively demonstrate that the enzymatic activity is mannuronan C-5-epimerase activity. This activity is expressed from a number of the plasmids showing that an entire epimerase gene/protein is not required in order to maintain epimerase activity. The epimerase activity is Ca²⁺ dependent.

EXAMPLE 1

Purification of mannuronan C-5-epimerase (1), partial amino acid sequencing and synthesis of a mixed DNA probe. The enzyme was isolated from liquid cultures of A. vinelandii essentially as described in Skjåk-Br.ae butted.k, G. and Larsen, B. Carbohydrate Research, 103, (1982) 137-140!. The cells were removed by centrifugation and the enzyme was isolated by precipitation with 30% ammoniumsulphate and followed by centrifugation for 20 min. at 10000 rpm. The supernatant was then precipitated with 50% ammonium sulphate (final concentration), and the precipitate after centrifugation was dissolved in 0.05 M imidazole/HCl (pH 6.8) containing 0.34 mM CaCl₂ and 0.5 mM dithiothreitol. This crude extract was then desalted on a prepacked column (PD-10) of Sephadex G-25 (Pharmacia) equilibrated with the same buffer. The extract was then applied on an alginate-Sepharose column. Proteins bound by non-specific interactions were eluted with 0.1 M NaCl. The epimerase was eluted as a sharp peak with 0.5 M NaCl. To make the enzyme pure enough for protein sequencing, it was dialyzed against TE-buffer overnight, freezedried, and further purified by SDS-PAGE gel electrophoresis (7.5% polyacrylamide in 25 mM Tris, 192 mM glycine, 0.1% sodium dodecyl sulphate, pH 8.3.) followed by electroblotting (in electrophoresis buffer without sodium dodecyl sulphate) onto a polyvinylidene difluoride membrane, poresize 0.45 μm (Millipore). The membrane was stained with Coomassie brilliant blue and air dried, and the protein with Mw 122 kd was cut out for N-terminal sequencing on a model 477A protein sequencing apparatus from Applied Biosystems. A DNA oligonucleotide was synthesized on the basis of the amino acid sequence information, and this oligonucleotide was used as a probe for screening of the gene library after end-labelling with ³² P by polynucleotide kinase.

EXAMPLE 2

Isolation of DNA from A. vinelandii and construction of a gene library. A. vinelandii cells were harvested and washed once in 0.9% Nacl. They were then lysed according to Hansen, J. B. and Olsen, R. H., J. Bacteriol., 135, 227-238, (1978)!, and the lysate was extracted twice with phenol and twice with chloroform. Nucleic acids were precipitated with ethanol, and the DNA was collected on a glass rod and dissolved in TE-buffer (10 mM Tris, 1 mM Na₂ EDTA, pH 7.9). Further purification was obtained by CsCl/ethidium bromide density gradient centrifugation. After removal of the ethidium bromide by isopropanol extraction, the DNA solution was dialyzed against TE buffer.

The DNA (molecular size greater than 60 kb) was subjected to partial Sau3AI digestion under conditions maximizing the generation of 15-20 kb fragments. After ethanol precipitation the DNA was dissolved in 40 μl TE buffer to give a concentration of 0.5 μg/μl. The DNA was then dephosphorylated with calf intestine phosphatase, followed by inactivation of the enzyme by incubation at 75° C. for 10 minutes in the presence of 10 nM nitrilotriacetic acid. The dephosphorylated DNA was precipitated with ethanol and dissolved in 40 μl 0.1×TE buffer.

EMBL3 vector DNA was digested with BanHI+EcoRI, followed by an isopropanol precipitation step under conditions leaving the short BamHI/EcoRI oligonucleotides in solution Frischauf A., Lehrach H., Poustka A. and Murray N., J. Mol. Biol., 170, 827-842, (1983)!. The Sau3AI-digested and dephosphorylated A. vinelandii DNA (1.75 μg) was then ligated with the BamHI/EcoRI-digested vector DNA (4.75 μg), using T4 DNA ligase in a total reaction volume of 20 μl. After ligations over night at 10° C., 10 μl of the ligation mixture was subjected to in vitro packaging in a Promega Biotech packaging system. The in vitro constructed phage particles were titrated on the E. coli strain Q359, and the library was finally amplified on Q359 in one cycle by plating on solid medium. Screening of the library was performed according to standard protocols Sambrook, J., Fritsch, E. F. and Maniatis, T., Molecular Cloning, Laboratory Manual, 2nd Ed., Cold Spring Harbour Laboratory Press, (1989)!, except that the highest stringency wash was 3.2 M tetramethylammonium chloride at 50° C. A total of 1.4×10⁵ primary recombinant phages were constructed, a library complexity far above what is required for obtaining representativity of A. vinelandii genes.

EXAMPLE 3

Measurements of epimerase activity from mannuronan C-5-epimerase (1). (5-³ H) alginate was prepared as described in Skjåk-Br.ae butted.k, G. and Larsen, B., Carbohydrate Res., 103, 133-136, (1982)!. The (5-³ H)alginate was produced by growing Azotobacter vinelandii in a medium consisting of D-Glucose (20 g), K₂ HPO₄ (1 g), MgSO₄.7 H₂ O (200 mg), FeSO₄. 7 H₂ O (50 mg), NaMoO₄. 2 H₂ O (5 mg), NH₄ OAc (2.3 g) and CaCl₂. 2 H₂ O (50 mg) diluted to one liter with water. The cells were grown at 30° C. with vigorous shaking. After 30 hours, D- 5-³ H!glucose was added to a concentration of 0.6 mg/ml (Specific activity, 0.7 μCi/mg) and the cells were allowed to grow for another 72 hours. The culture was cooled in an ice-bath, and the cells were removed by centrifugation. The supernatant solution was dialysed against 0.05 M sodium EDTA (3×5 liters) for 24 hours followed by exhaustive dialysis against distilled water. The sodium alginate was then precipitated with ethanol in the presence of 0.2% of sodium chloride. The specific activity of the label was 29 000 dpm/mg alginate. The composition of this alginate was also analyzed by NMR spectroscopy, and it was found to contain 59% mannuronic acid. Phage lysates were prepared by plating 105 phages per plate. Two ml 2,3,6-trimethylpyridine buffer (50 mM, pH 6.9) were added to each plate, and the softagarose/buffer mixture was scraped off, vortexed and centrifuged at 10,000 rpm for 10 min. The supernatant was used for incubations with (5-³ H)alginate by mixing 0.25 ml (5-³ H)alginate (2.5 mg/ml), 6,3 μl 0.1 M CaCl₂, and 1.45 ml phage lysate. The mixture was incubated at 30° C. overnight, and the alginate was precipitated by addition of 15 μl 5 M NaCl and 2 ml ethanol. After incubation at -20° C. for 30 min. the solution was centrifuged at 10.000 rpm for 30 min., and 1 ml of the supernatant was used for determination of released ³ H Skjåk-Br.ae butted.k, G. and Larsen, B., Carbohydrate Res., 103, 133-136, (1982)!. in a liquid scintillation counter.

For measurements of epimerase activity as release of ³ H in cells containing recombinant plasmids, the cell cultures were harvested by centrifugation and resuspended in 2,3,6 trimethylpyridine buffer. When IPTG (3 mM) was used for induction of the lac-promoter, the inducer was added to exponentially growing cells and incubations were continued for 3 hours. Cells were disrupted by ultra-sonication, and varying amounts of the lysates were incubated with shaking together with 100 μl (5-³ H) alginate (2.5 mg/ml) and 400 μl 2,3,6-trimethylpyridine buffer (total volume 0.6 ml) in the presence of 3.3 mM CaCl₂. The quantities of enzyme-containing cell extracts used were adjusted such that the measurements were performed under conditions where the enzyme represented the limiting factor. After incubation at 30° C. at the times indicated in each case, the mixtures were precipitated with ethanol under the conditions described above for phage lysates and 1.0 ml of the supernatant was used for scintillation counting. Controls (using the appropriate host with the pUC18 vector) gave low backgrounds and these numbers were subtracted in the values presented in Table 3.

EXAMPLE 4

Molecular cloning of a DNA fragment expressing a mannuronan C-5-epimerase activity in E. coli. The A. vinelandii gene library was constructed by cloning partially Sau3AI-digested A. vinelandii DNA into the bacteriophage λ vector EMBL3. In order to identify the epimerase gene in this library, we constructed a DNA probe based on the assumption that the previously purified 122 kd protein represented the epimerase Skjåk-Br.ae butted.k, G. and Larsen, B., Carbohydr. Res., 103, 137-149 (1982)!. Initially we tried to use the corresponding protein solution for determination of the N-terminal amino acid sequence of the 122 kd protein, but the results showed that the preparation was not sufficiently pure for this purpose. We therefore purified the protein further by SDS-polyacrylamide gel-electrophoresis, followed by electroblotting onto a membrane. The band containing the 122 kd protein was cut out from this membrane and subjected to N-terminal amino acid sequence analysis. Based on parts of this sequence, we synthesized the mixed DNA probe shown in FIG. 1 (SEQ ID NO:6).

The DNA probe synthesized as in Example 1 was labelled with ³² P and then used for screening of the A. vinelandii gene library. Clones which hybridized reproducibly against the labelled probe were identified at a frequency of approximately 10⁻³, and six such clones were selected for further studies. Phage lysates were prepared from each of the six clones, and each lysate was assayed for epimerase activity (Table 2). As can be seen, the lysates prepared from all six clones appeared to contain a weak enzyme activity that could represent the epimerase. This conclusion was further supported by the observation that control lysates prepared from randomly picked recombinant phages in the library, gave reproducibly lower activity, representing the background activity.

EXAMPLE 5

Subcloning of a DNA fragments encoding the epimerase. DNA from phage EP2 was partially digested with Sau3AI, and fragments ranging from 4 to 9 kb in size were subcloned in the BamHI site of plasmid pUC18. Cell extracts from DH5α transformants containing recombinant plasmids were assayed for epimerase activity, and the corresponding plasmids were also hybridized against the synthetic oligonucleotide used for screening of the gene library. The analysis of the cell extracts showed that one of them contained an enzymatic activity consistent with the assumption that a polypeptide having epimerase activity was expressed from the plasmid (pHE1) in this clone (see Table 3). We have also tried to centrifuge the extract at 30000 g for 3.5 hours, and found that the activity was not significantly reduced in the supernatant. Since we were unable to detect any significant activity in the culture medium, we conclude that the epimerase is localized intracellularly in E.coli. The insert in pHE1 also hybridized against the synthetic oligonucleotide used for screening, and pHE1 was therefore selected for further analysis.

EXAMPLE 6

Characterisation of the cloned DNA required for expression of the epimerase, and stability of the enzyme in vivo and in vitro. The insert in pHE1 is approximately 4 kb in size, and FIG. 2 shows the restriction map of this insert. Hybridization analysis of pHE1 with the original synthetic oligonucleotide showed that the sequence hybridizing to the oligonucleotide was localized downstream of the SphI site. The hybridizing sequence was further characterized by DNA sequencing, and this analysis showed that one of the potential reading frames of the sequence was in 100% agreement with the original N-terminal amino acid sequence of the 122 kd protein. Surprisingly, however, the orientation of the sequence was such that it would be transcribed out of the cloned fragment (see FIG. 2). This result thus indicated that the observed epimerase activity was not correlated with the sequence encoding the 122 kd protein, a conclusion that was further confirmed by the observation that the terminal 0.5 kb SphI fragment could be deleted (generating plasmid pHE7) without loss of the epimerase activity from the corresponding cell extract. In addition to the SphI deletion, we deleted (from pHE7) the 0.7 kb KpnI fragment at the opposite terminus of the insert, generating plasmid pHE5. As shown in Table 3, pHE5 (in DH5α) expressed the epimerase at a level approximately 27 times higher than the level of expression from pHE1.

During the expression studies described above we found that the measurements were quantitatively difficult to reproduce unless the time of harvesting the cells were kept as constant as possible. We have analyzed this problem more carefully by measuring the enzyme activity at different stages of growth of the E. coli cells. The results of such an analysis are shown in FIG. 3, and indicate that the enzymatic activities in the cell extracts are drastically reduced shortly after the cells have entered the stationary phase. To obtain optimal enzyme yields it is therefore important to harvest the cells at the end of the exponential phase or at the beginning of the stationary phase. The reason for the reduction of epimerase activity might potentially be due to proteolysis of the epimerase in stationary phase cells. To study the stability of the enzyme in vitro we have also analyzed the kinetics of ³ H release in the DH5α(pHE5) extracts. As can be seen from FIG. 4, the enzyme activity is linear over at least 30 hours, demonstrating that the enzyme is very stable in vitro. A critical parameter for obtaining reproducible results is thus the time of harvesting of the cells.

EXAMPLE 7

Stimulation of the epimerase activity by induction of the lac promoter. The results described above showed that the levels of expression of the epimerase from pHE5 was significantly higher than in pHE1. The reasons for this could potentially be that the lac promoter was important for the expression, and we have therefore analysed this problem more closely. The analyses were performed in the E. coli strain JM105, a strain which expresses high levels of lac repressor, thus allowing a more repressed state of the promoter under uninduced conditions. When cell extracts prepared from uninduced and induced (with IPTG) cells of JM105(pHE1), a significant stimulation of enzyme activity was observed in the induced cells (Table 3). A similar experiment using JM105(pHE5) showed even greater stimulation of the expression of the epimerase upon addition of IPTG in this case. These experiments thus showed that the lac promoter probably is a key element, although not necessarily the only element, involved in the expression of the epimerase from pHE5. The experiments in addition showed that the direction of transcription is from the KpnI towards the SphI site in the insert. The epimerase gene is therefore transcribed in the same direction as the gene encoding the 122 kd protein, whose N-terminal amino acid sequence was used for the isolation of the cloned DNA.

Preliminary experiments on deleting more DNA from the SphI side of the insert indicated that very little could be deleted without loss of the epimerase activity. At the KpnI side, on the other hand, we found that significant deletions were tolerated. Table 3 shows the results of analysis of expression of the epimerase from a plasmid (pHE8) constructed by deleting the 0.8 kb KpnI/SacII fragment from pHE5. As can be seen, this deletion resulted in a very strong stimulation of the epimerase activity both in uninduced and induced cells. The expression from pHE8 is presumably based on initiation of translation from the Shine-Dalgarno sequence in the vector (localized between the lac promoter and the polylinker). Similarly, high levels of expression were obtained from pHE22 also due to the coding sequences being in frame with the Shine-Dalgano sequence. So far we have not obtained expression of the epimerase in constructs where deletions have extended beyond the SacII site.

EXAMPLE 8

Use of a different promotor than the lac promotor. The insert in pHE5 (EcoRI-HindIII) was sub-cloned into plasmid pT7-3 (a derivative of pT7-1 described by Tabor, S., and C. C. Richardson (1985). Proc. Natl. Acad. Sci. 82, 1074-1078!), and the new plasmid was designated pLB1. The insert in pLB1 is localized downstream of the φ10 promoter in the vector. This promoter is only recognized by the bacteriophage T7 RNA polymerase, and expression of genes downstream of this promoter thereby becomes dependent on expression of this polymerase activity in the cells. The 442 bp (see FIG. 2) SacI- SpoI fragment was finally deleted from the insert in pLB1, generating plasmid pLB2. pLB2 was transformed into E. coli K38 (pGP1-2). Plasmid pGP1-2 encodes the gene for T7 RNA polymerase, and the expression of the gene is controlled by a temperature inducible repressor. K38(pLB1, pGP1-2) was grown in exponential phase at 30° C. for 41/2 hours. One of two parallel cell cultures was then transferred to 42° C. for 30 minutes to induce the T7-polymerase. The other parallel cell culture was grown at 30° C. for 5 hours. The epimerase activities in the cells were measured as described in example 3, and the results of the measurements are shown in Table 3.

EXAMPLE 9

Cloning of mannuronan C-5-epimerase (2). Plasmid pHE12 was constructed by inserting a 6.2 kb XhoI fragment from the recombinant bacteriophage lambda derivative EP2 into pUC128. As can be seen from FIG. 2 the insert in pHE12 is partly overlapping with the insert in pHE1. Analysis of extracts prepared from cells containing pHE12 (as described for pHE1), showed that they expressed mannuronan C-5-epimerase activity (Table 3). Further analysis showed that the 2.5 kb SpoI-XhoI fragment could be deleted from the insert in pHE12 without affecting the expression of mannuronan C-5-epimerase. Further plasmids were constructed (see FIG. 2) and the activity analysed (see Table 3). This demonstrated that both the genes and gene fragments were able to express epimerase activity. The nucleotide sequences of the inserts were determined by the method of Sanger Sanger, F., S. Nicklen, and A. R. Coulson. 1977. Proc. Natl. Acad. Sci. 74, 5436!. The nucleotide sequences are shown in FIG. 6.

EXAMPLE 10

Sequence Comparison. Five genes have been identified as shown in FIG. 2. The insert containing E5 is located about 5-10 kilobases away from the other genes. SEQ ID NO:1 shows the nucleotide sequence for the complete genes of E4, E1, E2 and a large portion of E3. SEQ ID NO:2-SEQ. ID NO:5 sequentially show the amino acid sequences for E4, E1, E2 and a large portion of E3. Detailed analysis of the nucleotide and amino acid sequences revealed highly homologous regions within each gene and between the various genes. FIG. 5 characterises each of the genes by reference to the homologous blocks. Each of the genes has at least one A-element and at least one R-element.

E1, E2 and E4 all end with a reasonably homologous sequence termed the S-element (not shown in FIG. 5). The last 14 amino acids of the S-element of E1 and E2 are identical with one exception. FIGS. 6-9 show detailed analysis of the A- and R-elements within each gene by reference to the consensus sequence (con) (SEQ ID NO:7-SEQ ID NO:10). Each A-element is approximately 1,150 base pairs long and each R-element is approximately 450 base pairs long. Short oligonucleotides are present in E1, E2 and E3 between the second and third R-elements. Gaps have been introduced where necessary to maximise alignment (see in particular the third R-element of E2).

Hybridization with a probe made from the first part of the A-element to a Southern blot of A. vinelandii digested with restriction endonuclease Bg1II gave 5 distinct bands. One of these bands contained two A blocks, and another of these bands contained two different fragments with the same size. The number of bands were the same when other strains (ATCC 478, ATCC 12837 and ATCC 12518) of the same species were used. This implies that the bacterium contains at least 5 copies of the A-element, and that this is common to several independently isolated strains of A. vinelandii.

The first part of each R-element contains six perfect and imperfect repeats of a nonapeptide with the consensus sequence LXGGAGXDX (SEQ ID NO:11), except for the third R-element of E2 which lacks two of these repeats. FIG. 11 shows the complete nucleotide (SEQ ID NO:1) and corresponding amino acid sequence of E2. The nonapeptides have been marked with double lines for a good match with the consensus sequence and single lines for less good matches. This nonapeptide motif is characteristic of the haemolysin family of secreted proteins (Suh, Y. and Benedik, M. J., J. Bacteriol 174, (1992) 2361-2366). These proteins are all calcium dependent and are secreted by a pathway which does not involve cleavage of an N terminal signal peptide. For haemolysin secreted from E. coli it has been proposed that the nonamers are responsible for the binding of calcium ions (Ludwig, A. et al, Mol. Gen. Genet. 214, (1988) 553-561, Boehm, D. F. et al, Infect. Immun. 58 (1990) 1959-1964). It appears that the R-elements are involved in calcium ion binding, calcium being necessary for both enzyme activity and gel formation.

EXAMPLE 11

Making an altered epimerase.

As can be seen from Table 3, various elements may be deleted from the gene, while maintaining the expression of a protein having epimerase activity. Clearly, the latter portion of E1 having the sequence ARS has epimerase activity (see plasmid pHE8), although deletions in A2 of E1 are not tolerated (see Example 7). Additionally, E2 having the sequence ARRRRS (SEQ ID NO:12) also demonstrates epimerase activity. Additionally, fragments of E3 lacking a carboxy terminal and having the sequences ARRR (SEQ ID NO:15) and ARRRARR (SEQ ID NO:13) have expressed epimerase activity. (See plasmids pH18 and pBD6). Accordingly, it appears that an S-element is not essential for epimerase activity, although the presence of this element may affect activity. We therefore postulated that an epimerase may need at least one A-element and at least one R-element, and that it should be possible to make altered epimerases by combining these elements in different ways. To show this, we constructed a plasmid encoding an epimerase with the sequence RARS (SEQ ID NO:14):

The insert in pHE1 (EcoRI-HindIII) was subcloned into plasmid pTrc99A (Pharmacia), generating plasmid pHE21. This plasmid contains a trc-promoter in front of the epimerase I gene, a strong transcription termination signal downstream of the gene, and the lacI^(q) -gene allowing induction with IPTG. pHE21 was digested with KpnI and SpoI, made blunt-ended with S1 nuclase and religated. The resulting plasmid, pHE22, expresses a protein having the carboxy terminal of epimerase 1, RARS. The epimerase activity was measured as in Example 3, see Table 3.

Given that epimerase activity is expressed from a number of the constructs, it seems likely that a number of synthetic enzymes may be produced having epimerase activity including differing numbers of A, R and S blocks. The presence of activity in pHE22 implies that it is not essential to have an amino-terminal A block, and so block order may also be altered.

EXAMPLE 12

The ¹ H-NMR spectra of alginate epimerased by extracts from plasmids pHE8 and pBD9 show that the proteins encoded by these plasmids have different enzyme activity, pHE8 producing epimerase with single G activity while pBD9 producing epimerase with G block activity. pHE8 encodes the carboxy terminal ARS of E1 whereas pBD9 encodes ARRRRS (SEQ ID NO:12) of E2. The naturally encoded epimerases may therefore have differing activity particularly in the distribution patterns of Gs. The different activity of the various epimerases encoded within the 5 genes could be used to create alignates having a desired structure, by selectively expressing a desired gene or genes. Alternatively, it may be possible to construct synthetic enzymes varying the A, R and S block content of each epimerase to provide enzymes having altered activity providing a further level of control in the production of desired alginates.

                  TABLE 1     ______________________________________     Bacterial strains, phages, and plasmids     Strain/phage/     plasmid Remarks             References     ______________________________________     Bacterial     strains     A. vinelandii             Strain E            Larsen and Haug                                 (1971)     E. coli     Q359    supE hsdR φ80.sup..left brkt-top. P2                                 Karn et al. (1980)     DH5α             supE44ΔlacU169 (φ80 lacZΔM15)                                 Bethesda Research             hsdR17 recA1 endA1 gyrA96                                 Laboratories (1986)             thi-1 relA1     JM105   supE endA sbcB15 hsdR4 rpsL                                 Yanisch-Perron             thiΔ(lac--proAB)                                 et al (1985)     JM109   recA1 supE44 endA1 hsdR17                                 Yanisch-Perron             gyrA96 relA1 thiΔ(lac--proAB)                                 et al (1985)             F traD36 proAB .sup.+ lacI.sup.q lacZΔM15!     SURE ™             e14.sup.- (mcrA),   Greener (1990)             Δ(mcrCB--hsdSMR--mrr)             171, endA1, supE44, thi-1,             gyrA96, relA1, lac, recB, recJ,             sbcC, umuC:Tn5(kan.sup..left brkt-top.), uvrC,  F'             proAB, lacI.sup.q ZΔM15, Tn10, (tet.sup..left brkt-top.)!.     Phages     EMBL3   Bacteriophage λ vector used for                                 Frischauf et al.             construction of A. vinelandii                                 (1983)             gene library     EPx     Randomly picked phage from                                 See examples             A. vinelandii gene library     EP2, -3,             Phages isolated from A.                                 See examples     -6, -7, vinelandii gene library     -8 and -9             Identified by hybridization and             expresses mannuronan C-5-epimerase     Plasmids     pUC18   Ampicillin resistance, ColE1                                 Norrander et al.             replicon     pUC128  Ampicillin resistance, ColE1                                 Keen et al. (1988)             replicon     pTrc99A Ampicillin resistance, ColE1                                 Pharmacia             replicon     pT7-3   Ampicillin resistance, ColE1                                 Tabor & Richardson             replicon            (1985)     pGP1-2  Kanamycin resistance, P15A                                 Tabor & Richardson             replicon            (1985)     pBluescript             Ampicillin resistance, ColE1     II SK(+)             replicon     pHE1    Derivative of pUC18 where a                                 See examples             4 kb Sau3A1 DNA fragment from             phage EP2 was subcloned into the             BamH1 polylinker     pHE7    Derivative of pHE1 where a 0.5 kb                                 See examples             Sph1 DNA fragment was deleted     pHE5    Derivative of pHE7 where a 0.7 kb                                 See examples             Kpn1 DNA fragment was deleted     pHE8    Derivative of pHE5 where a 0.8 kb                                 See examples             Kpn1/SacII DNA fragment was             deleted. Cohesive ends were             removed prior to ligation by             using the 3' exonuclease activity             of T4 DNA polymerase     pLB1    Derivative of pT7-3, where the                                 See examples             2,7 kb insert from pHE5 was cloned             into the EcoRI/HindIII polylinker     pLB2    Derivative of pLB1, where a 0.4 kb                                 See examples             SacI/SpoI fragment was deleted     pHE12   Derivative of pUC128, where a                                 See examples             6.2 kb XhoI fragment from phage             EP2 was subcloned into the XhoI             polylinker     pBD1    Derivative of pHE12, where a                                 See examples             2.0 kb SpoI/NsiI fragment was             deleted     pHE21   Derivative of pTrc99A, where the                                 See examples             4.0 kb insert of pHE1 was cloned             into the EcoRI/HindIII polylinker     pHE22   Derivative of pHE21, where a                                 See examples             1.2 kb KpnI/SpoI fragment was             deleted     pHE2    Derivative of pUC18 where a                                 See examples             6.0 kb SphI DNA fragment from             phage EP6 was cloned into the             SphI site in the polylinker of             the vector     pHE16   Derivative of pHE2 where a                                 See examples             1.5 kb EcoRI--SmaI DNA fragment             was deleted     pBD9    Derivative of pBD1 where a                                 See examples             0.4 kb XhoI--FspI DNA fragment             was deleted     pBD6    Derivative of pHE12 where a                                 See examples             3.4 kb XhoI--EspI DNA fragment             was deleted     pHE18   Derivative of pUC128 where an                                 See examples             5.1 kb NotI--PvuII DNA fragment             from EP6 was cloned into the             NotI--EcoRV sites in the polylinker             of the vector     pHE14   Derivative of pUC128 where a                                 See examples             3.0 kb BglII DNA fragment from             EP6 was cloned into the BamHI             site of the polylinker     pML1    Derivative of pBluescript II                                 See examples             SK(+) where a 4.3 kb KpnI--SacII             DNA fragment was cloned into             the corresponding sites in the             polylinker     ______________________________________

References to Table 1

Larsen, B., and A. Haug. 1971. Biosynthesis of alginate. Carbohydr. Res. 17:287-296.

Karn, J., S. Brenner., L. Barnett, and G. Cesareni. 1980. Proc. Natl. Acad. Sci. U.S.A. 77:5172.

Bethesda Research Laboratories. 1986. Bethesda Res. Lab. Focus 8(2):9.

Yanisch-Perron, C., J. Vieira, and J. Messing. 1985. Gene 33:103-119.

Frischauf, A., H. Lehrach, A. Poustka, and N. Murray. 1983. J. Mol. Biol. 170:827-842.

Norrander, J., T. Kempe, J. Messing. 1983. Gene 26:101-106.

Keen N. T, S. Tamaki, D. Kobayashi and D. Trollinger (1988). Gene 79:191-197.

Tabor S. and C. C. Richardson (1985). Proc Natl Acad Sci USA 82:10741078.

Greener, A. (1990) Strategies 3:5-6.

                  TABLE 2     ______________________________________     Putative mannuronan C-5-epimerase activity     in recombinant phage lysates.     Recombinant phage                     .sup.3 H release (dpm)     ______________________________________     EPx             39     EP2             91     EP3             107     EP6             74     EP7             75     EP8             245     EP9             75     ______________________________________      EPx originated from the A. vinelandii gene library as a randomly picked      plaque, while the other six phages were selected on the basis of the      hybridization between their DNA and the labelled oligonucleotide used for      screening of the library.

                  TABLE 3     ______________________________________     Activity of the mannuronan C-5-epimerase     expressed from the plasmids.                        Released .sup.3 H/0D.sub.600                        unit cell culture     Enzyme    Strain         No IPTG  IPTG     ______________________________________     Epimerase 1.sup.1               JM109 (pHE16)  10000    11000     Epimerase 1.sup.2               DH5α (pHE1)                               273     nd     Epimerase 1.sup.3               DH5α (pHE5)                              9700     nd     Epimerase 1.sup.4               JM105 (pHE1)    637     2800     Epimerase 1.sup.5               JM105 (pHE5)   4800     28500     Epimerase 1.sup.6               JM105 (pHE8)   58900    181000     Epimerase 1.sup.7               JM109 (pHE21)   93      1283     Epimerase 1.sup.9               JM109 (pHE22)  5383     34611     Epimerase 1.sup.9               K38 (pGP1-2, pLB2)*                              2150     8333     Epimerase 2.sup.1               JM109 (pHE12)  nd        140     Epimerase 2.sup.2               JM109 (pBD9)   nd       6700     Epimerase 3.sup.1               JM109 (pHE18)   551     2270     Epimerase 3.sup.2               JM109 (pBD6)   nd        530     Epimerase 4               DH5α (pHE14)                              nd       3500     ______________________________________      The extracts were incubated with the alginate for 16 hours, and the      numbers are given in dpm. nd = not determined.      *Thge culture was not induced by IPTG, but by raising the temperature fro      30° C. to 42° C.

                  TABLE 4     ______________________________________     NMR analysis of the reaction product after incubation of the     recombinant epimerase with a low guluronic acid containing substrate             CaCl.sub.2                   Frequences of M and G residues     Strain    (mM)    F.sub.M                              F.sub.G                                   F.sub.MM                                        F.sub.MG                                             F.sub.GG                                                  F.sub.GG /F.sub.G     ______________________________________     JM105(pHE5)               0.85    0.80   0.20 0.66 0.14 0.06 0.3     JM105(pHE5)               3.4     0.71   0.29 0.45 0.26 0.03 0.10     DH5α(pHE8)               3.4     0.72   0.28 0.52 0.20 0.08 0.29     JM109(pHE16)*               3.4     0.82   0.18 0.79 0.03 0.15 0.83     JM109(pBD9)               2.1     0.60   0.40 0.54 0.06 0.34 0.85     ______________________________________      *This spectrum was obtained at a 400 MHz instrument to be able to get      relatively correct figures in spite of the lower conversion.

    __________________________________________________________________________     #             SEQUENCE LISTING     - (1) GENERAL INFORMATION:     -    (iii) NUMBER OF SEQUENCES: 52     - (2) INFORMATION FOR SEQ ID NO:1:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 12588 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (vi) ORIGINAL SOURCE:     #vinelandiiA) ORGANISM: Azotobacter               (B) STRAIN: E     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 290..1951     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 2227..6438     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 6702..9695     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 9973..12588     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:     - GATCCGGCCG TCTGAGACGG CGCCTCCGGC CGTCGGCGAG TGCGCCGTTC GC - #CGACGGCC       60     - GGGCGAACGG ATGAGGACTG CTCCACTCTC ACCCAGATAA GCGCGTGGGC CG - #TTTCATCC      120     - GAGCGCCTTT CCGGGCCGCT TCGAAAGACC GCCACGAGGC ACTCTGTGCA AG - #GGCCAGGC      180     - AGTCGCGTTG CAACCGGAGA CGGGACCGGC CCGTTCGGGC GTCGTCTCTT CC - #CGCTCCAC      240     - TTTTTCCAGG CAGCTTCGGC TGCTCCACTC GGAACCGGGA AGCGGAGAT ATG - # GAT      295     #Met Asp     #  1     - TAC AAC GTC AAG GAT TTC GGT GCA TTG GGC GA - #C GGC GTC AGC GAC GAC      343     Tyr Asn Val Lys Asp Phe Gly Ala Leu Gly As - #p Gly Val Ser Asp Asp     #          15     - CGG GCC TCC ATC CAG GCG GCG ATC GAT GCC GC - #C TAC GCC GCC GGT GGC      391     Arg Ala Ser Ile Gln Ala Ala Ile Asp Ala Al - #a Tyr Ala Ala Gly Gly     #     30     - GGT ACC GTC TAC CTG CCG GCC GGC GAG TAC CG - #G GTC AGC GCC GCC GGG      439     Gly Thr Val Tyr Leu Pro Ala Gly Glu Tyr Ar - #g Val Ser Ala Ala Gly     # 50     - GAG CCG GGC GAC GGC TGC CTG ATG CTC AAG GA - #C GGC GTC TAC CTG GCC      487     Glu Pro Gly Asp Gly Cys Leu Met Leu Lys As - #p Gly Val Tyr Leu Ala     #                 65     - GGT GCC GGC ATG GGC GAG ACG GTG ATC AAG CT - #G ATC GAC GGC TCC GAC      535     Gly Ala Gly Met Gly Glu Thr Val Ile Lys Le - #u Ile Asp Gly Ser Asp     #             80     - CAG AAG ATC ACC GGC ATG GTC CGC TCG GCC TA - #C GGC GAG GAA ACC AGC      583     Gln Lys Ile Thr Gly Met Val Arg Ser Ala Ty - #r Gly Glu Glu Thr Ser     #         95     - AAC TTC GGC ATG CGC GAC CTG ACC CTC GAC GG - #C AAC CGC GAC AAC ACC      631     Asn Phe Gly Met Arg Asp Leu Thr Leu Asp Gl - #y Asn Arg Asp Asn Thr     #   110     - AGC GGC AAG GTC GAC GGC TGG TTC AAC GGC TA - #T ATC CCC GGC GGG GAC      679     Ser Gly Lys Val Asp Gly Trp Phe Asn Gly Ty - #r Ile Pro Gly Gly Asp     115                 1 - #20                 1 - #25                 1 -     #30     - GGC GCC GAC CGC GAC GTG ACC ATC GAG CGG GT - #G GAG GTC CGC GAG ATG      727     Gly Ala Asp Arg Asp Val Thr Ile Glu Arg Va - #l Glu Val Arg Glu Met     #               145     - TCC GGC TAC GGC TTC GAC CCC CAC GAG CAG AC - #C ATC AAC CTG ACG ATC      775     Ser Gly Tyr Gly Phe Asp Pro His Glu Gln Th - #r Ile Asn Leu Thr Ile     #           160     - CGC GAC AGC GTG GCC CAC GAC AAC GGC CTC GA - #C GGC TTC GTC GCC GAC      823     Arg Asp Ser Val Ala His Asp Asn Gly Leu As - #p Gly Phe Val Ala Asp     #       175     - TAC CTG GTC GAC AGC GTG TTC GAG AAC AAC GT - #C GCC TAC GCC AAC GAC      871     Tyr Leu Val Asp Ser Val Phe Glu Asn Asn Va - #l Ala Tyr Ala Asn Asp     #   190     - CGC CAC GGC TTC AAC GTG GTC ACC AGC ACC CA - #C GAT TTC GTC ATG ACC      919     Arg His Gly Phe Asn Val Val Thr Ser Thr Hi - #s Asp Phe Val Met Thr     195                 2 - #00                 2 - #05                 2 -     #10     - AAC AAC GTC GCC TAC GGC AAC GGC AGC AGC GG - #C CTG GTG GTG CAG CGG      967     Asn Asn Val Ala Tyr Gly Asn Gly Ser Ser Gl - #y Leu Val Val Gln Arg     #               225     - GGT CTG GAG GAC CTC GCG CTG CCC AGC AAC AT - #C CTG ATC GAC GGC GGC     1015     Gly Leu Glu Asp Leu Ala Leu Pro Ser Asn Il - #e Leu Ile Asp Gly Gly     #           240     - GCC TAC TAC GAC AAC GCC CGC GAA GGC GTG CT - #G CTC AAG ATG ACC AGC     1063     Ala Tyr Tyr Asp Asn Ala Arg Glu Gly Val Le - #u Leu Lys Met Thr Ser     #       255     - GAC ATC ACC CTG CAG AAC GCC GAT ATC CAC GG - #C AAC GGC TCC TCC GGG     1111     Asp Ile Thr Leu Gln Asn Ala Asp Ile His Gl - #y Asn Gly Ser Ser Gly     #   270     - GTG CGC GTC TAC GGC GCC CAG GAC GTG CAG AT - #C CTC GAT AAC CAG ATC     1159     Val Arg Val Tyr Gly Ala Gln Asp Val Gln Il - #e Leu Asp Asn Gln Ile     275                 2 - #80                 2 - #85                 2 -     #90     - CAC GAC AAC GCG CAG GCG GCC GCC GTG CCC GA - #G GTC CTG CTG CAG TCC     1207     His Asp Asn Ala Gln Ala Ala Ala Val Pro Gl - #u Val Leu Leu Gln Ser     #               305     - TTC GAC GAT ACC GCC GGG GCG TCC GGC ACC TA - #C TAC ACG ACC CTG AAC     1255     Phe Asp Asp Thr Ala Gly Ala Ser Gly Thr Ty - #r Tyr Thr Thr Leu Asn     #           320     - ACC CGG ATC GAG GGC AAC ACC ATC AGC GGC TC - #G GCC AAC TCC ACC TAC     1303     Thr Arg Ile Glu Gly Asn Thr Ile Ser Gly Se - #r Ala Asn Ser Thr Tyr     #       335     - GGC ATC CAG GAG CGC AAC GAC GGC ACC GAC TA - #C AGC AGC CTG ATC GAC     1351     Gly Ile Gln Glu Arg Asn Asp Gly Thr Asp Ty - #r Ser Ser Leu Ile Asp     #   350     - AAC GAC ATC GCC GGG GTG CAA CAG CCC ATC CA - #A CTG TAC GGA CCT CAC     1399     Asn Asp Ile Ala Gly Val Gln Gln Pro Ile Gl - #n Leu Tyr Gly Pro His     355                 3 - #60                 3 - #65                 3 -     #70     - TCG ACG GTA TCC GGC GAA CCC GGC GCG ACA CC - #G CAA CAG CCG TCC ACG     1447     Ser Thr Val Ser Gly Glu Pro Gly Ala Thr Pr - #o Gln Gln Pro Ser Thr     #               385     - GGA AGC GAC GGC GAG CCA CTG GTC GGC GGC GA - #C ACG GAC GAC CAG CTC     1495     Gly Ser Asp Gly Glu Pro Leu Val Gly Gly As - #p Thr Asp Asp Gln Leu     #           400     - CAG GGC GGC TCC GGC GCC GAT CGC CTG GAC GG - #C GGG GCC GGC GAC GAC     1543     Gln Gly Gly Ser Gly Ala Asp Arg Leu Asp Gl - #y Gly Ala Gly Asp Asp     #       415     - ATC CTC GAC GGC GGC GCC GGG CGC GAC CGG CT - #G AGC GGC GGC GCG GGC     1591     Ile Leu Asp Gly Gly Ala Gly Arg Asp Arg Le - #u Ser Gly Gly Ala Gly     #   430     - GCC GAC ACC TTC GTG TTC TCC GCC CGC GAG GA - #C AGC TAC CGT ACC GAC     1639     Ala Asp Thr Phe Val Phe Ser Ala Arg Glu As - #p Ser Tyr Arg Thr Asp     435                 4 - #40                 4 - #45                 4 -     #50     - ACG GCG GTG TTC AAC GAC CTG ATC CTC GAC TT - #C GAG GCC AGC GAG GAT     1687     Thr Ala Val Phe Asn Asp Leu Ile Leu Asp Ph - #e Glu Ala Ser Glu Asp     #               465     - CGC ATC GAC CTG TCC GCG CTG GGC TTT TCC GG - #C CTG GGC GAC GGC TAT     1735     Arg Ile Asp Leu Ser Ala Leu Gly Phe Ser Gl - #y Leu Gly Asp Gly Tyr     #           480     - GGC GGC ACC CTG CTC CTG AAG ACC AAC GCC GA - #G GGC ACG CGC ACC TAC     1783     Gly Gly Thr Leu Leu Leu Lys Thr Asn Ala Gl - #u Gly Thr Arg Thr Tyr     #       495     - CTG AAA AGC TTC GAG GCG GAT GCC GAG GGA CG - #G CGC TTC GAG GTC GCC     1831     Leu Lys Ser Phe Glu Ala Asp Ala Glu Gly Ar - #g Arg Phe Glu Val Ala     #   510     - CTG GAC GGC GAC CAC ACG GGC GAT CTT TCC GC - #C GCC AAT GTG GTC TTC     1879     Leu Asp Gly Asp His Thr Gly Asp Leu Ser Al - #a Ala Asn Val Val Phe     515                 5 - #20                 5 - #25                 5 -     #30     - GCC GCG ACC GGG ACG ACC ACC GAA CTC GAA GT - #G CTC GGC GAC AGC GGC     1927     Ala Ala Thr Gly Thr Thr Thr Glu Leu Glu Va - #l Leu Gly Asp Ser Gly     #               545     - ACG CAG GCC GGG GCG ATC GTC TAG CGCGTCCCGC TC - #CGACACAT AGCCGGTCGT     1981     Thr Gln Ala Gly Ala Ile Val  *                 550     - CGGCAAGGCG GCCGGCCGCC GGCTGCCCCG AAGTTTCCAA TCTAATCTCA CC - #TACAGACA     2041     - GGCGCGTTCC GGTGCGCCCG AGCGCCGCCC CCGGGAACGA CCGGCAGGGC GT - #GTTTGTGC     2101     - GCAAGGTGCA GGCGGTCGCG CTCGAAGCCA GAGGCAGGGA AAACCTTTTC CG - #GCAGTCGT     2161     - CTCTTCCTTC TCCACTTCCC AGGCAGCCCT GGGCCGAGCA ACACGACGGG AT - #TAGGAAGC     2221     #GCA CTG GGC GAT GGC       2268 GAT TTC GGA     #Phe Gly Ala Leu Gly Asp Glys Asp     #     10     - GTC AGC GAC GAC ACG GCG GCC ATC CAG GCG GC - #G ATC GAC GCC GCC CAC     2316     Val Ser Asp Asp Thr Ala Ala Ile Gln Ala Al - #a Ile Asp Ala Ala His     # 30     - GCG GCG GGC GGC GGC ACC GTC TAC CTG CCG GC - #C GGC GAA TAT CGG GTC     2364     Ala Ala Gly Gly Gly Thr Val Tyr Leu Pro Al - #a Gly Glu Tyr Arg Val     #                 45     - AGC GGC GGC GAG GAG CCT TCC GAT GGT TGT CT - #G ACC ATC AAG AGC AAC     2412     Ser Gly Gly Glu Glu Pro Ser Asp Gly Cys Le - #u Thr Ile Lys Ser Asn     #             60     - GTC CAT ATC GTC GGC GCC GGG ATG GGC GAG AC - #G GTG ATC AAG ATG GTC     2460     Val His Ile Val Gly Ala Gly Met Gly Glu Th - #r Val Ile Lys Met Val     #         75     - GAC GGC TGG ACG CAG AAC GTC ACC GGC ATG GT - #G CGC TCG GCC TAC GGC     2508     Asp Gly Trp Thr Gln Asn Val Thr Gly Met Va - #l Arg Ser Ala Tyr Gly     #     90     - GAG GAA ACC AGC AAC TTC GGC ATG AGC GAC CT - #G ACC CTC GAC GGC AAC     2556     Glu Glu Thr Ser Asn Phe Gly Met Ser Asp Le - #u Thr Leu Asp Gly Asn     #110     - CGC GAC AAC CTG TCC GCC AAG GTC GAC GGC TG - #G TTC AAC GGC TAC ATC     2604     Arg Asp Asn Leu Ser Ala Lys Val Asp Gly Tr - #p Phe Asn Gly Tyr Ile     #               125     - CCC GGC CAG GAC GGC GCC GAT CGC GAC GTG AC - #C CTG GAG CGG GTG GAA     2652     Pro Gly Gln Asp Gly Ala Asp Arg Asp Val Th - #r Leu Glu Arg Val Glu     #           140     - ATC CGC GAG ATG TCC GGC TAC GGT TTC GAC CC - #C CAC GAG CAG ACC ATC     2700     Ile Arg Glu Met Ser Gly Tyr Gly Phe Asp Pr - #o His Glu Gln Thr Ile     #       155     - AAC CTG ACG ATC CGC GAC AGC GTG GCC CAC GA - #C AAC AGC CTC GAC GGC     2748     Asn Leu Thr Ile Arg Asp Ser Val Ala His As - #p Asn Ser Leu Asp Gly     #   170     - TTC GTC GCC GAC TAC CAG GTC GGC GGG GTG TT - #C GAG AAC AAC GTC TCG     2796     Phe Val Ala Asp Tyr Gln Val Gly Gly Val Ph - #e Glu Asn Asn Val Ser     175                 1 - #80                 1 - #85                 1 -     #90     - TAC AAC AAC GAC CGC CAC GGC TTC AAC ATC GT - #C ACC AGC ACC AAC GAC     2844     Tyr Asn Asn Asp Arg His Gly Phe Asn Ile Va - #l Thr Ser Thr Asn Asp     #               205     - TTC GTC CTG AGC AAC AAC GTC GCC TAC GGC AA - #C GGC GGC GCC GGC CTG     2892     Phe Val Leu Ser Asn Asn Val Ala Tyr Gly As - #n Gly Gly Ala Gly Leu     #           220     - GTG GTG CAG CGC GGC TCG TAC GAC CTG CCC CA - #T CCC TAC GAC ATC CTG     2940     Val Val Gln Arg Gly Ser Tyr Asp Leu Pro Hi - #s Pro Tyr Asp Ile Leu     #       235     - ATC GAC GGC GGC GCC TAC TAC GAC AAC GCC TT - #G GAA GGC GTG CAG CTC     2988     Ile Asp Gly Gly Ala Tyr Tyr Asp Asn Ala Le - #u Glu Gly Val Gln Leu     #   250     - AAG ATG GCC CAC GAC GTC ACC CTG CAG AAC GC - #C GAG ATC TAC GGC AAC     3036     Lys Met Ala His Asp Val Thr Leu Gln Asn Al - #a Glu Ile Tyr Gly Asn     255                 2 - #60                 2 - #65                 2 -     #70     - GGC CTG TAC GGG GTG CGC GTC TAC GGC GCC CA - #G GAC GTG CAG ATC CTC     3084     Gly Leu Tyr Gly Val Arg Val Tyr Gly Ala Gl - #n Asp Val Gln Ile Leu     #               285     - GAC AAC CAG ATC CAC GAC AAT TCG CAG AAC GG - #C GCC TAT GCC GAA GTC     3132     Asp Asn Gln Ile His Asp Asn Ser Gln Asn Gl - #y Ala Tyr Ala Glu Val     #           300     - CTG CTG CAG TCC TAC GAC GAC ACC GCC GGG GT - #G TCC GGC AAC TTT TAC     3180     Leu Leu Gln Ser Tyr Asp Asp Thr Ala Gly Va - #l Ser Gly Asn Phe Tyr     #       315     - GTC ACC ACC GGC ACC TGG CTC GAA GGC AAC GT - #C ATC AGC GGC TCG GCC     3228     Val Thr Thr Gly Thr Trp Leu Glu Gly Asn Va - #l Ile Ser Gly Ser Ala     #   330     - AAT TCC ACC TAC GGC ATC CAG GAG CGC GCC GA - #C GGC ACC GAC TAC AGC     3276     Asn Ser Thr Tyr Gly Ile Gln Glu Arg Ala As - #p Gly Thr Asp Tyr Ser     335                 3 - #40                 3 - #45                 3 -     #50     - AGC CTC TAC GCC AAC AGC ATC GAC GGT GTG CA - #G ACC GGG GCG GTA CGG     3324     Ser Leu Tyr Ala Asn Ser Ile Asp Gly Val Gl - #n Thr Gly Ala Val Arg     #               365     - CTG TAT GGC GCC AAC TCG ACG GTT TCC AGC CA - #G TCC GGC AGT GGC CAG     3372     Leu Tyr Gly Ala Asn Ser Thr Val Ser Ser Gl - #n Ser Gly Ser Gly Gln     #           380     - CAG GCG ACC CTC GAA GGC AGC GCG GGC AAC GA - #T GCG CTG AGC GGG ACC     3420     Gln Ala Thr Leu Glu Gly Ser Ala Gly Asn As - #p Ala Leu Ser Gly Thr     #       395     - GAG GCC CAC GAG ACG CTG CTC GGC CAG GCC GG - #C GAC GAC CGC CTG AAC     3468     Glu Ala His Glu Thr Leu Leu Gly Gln Ala Gl - #y Asp Asp Arg Leu Asn     #   410     - GGC GAT GCC GGC AAC GAC ATC CTC GAC GGC GG - #G GCA GGG CGC GAC AAC     3516     Gly Asp Ala Gly Asn Asp Ile Leu Asp Gly Gl - #y Ala Gly Arg Asp Asn     415                 4 - #20                 4 - #25                 4 -     #30     - CTG ACC GGC GGC GCG GGC GCC GAC ACC TTC CG - #C TTC TCC GCG CGC ACC     3564     Leu Thr Gly Gly Ala Gly Ala Asp Thr Phe Ar - #g Phe Ser Ala Arg Thr     #               445     - GAC AGC TAC CGC ACC GAC AGC GCC AGC TTC AA - #C GAC CTG ATC ACC GAC     3612     Asp Ser Tyr Arg Thr Asp Ser Ala Ser Phe As - #n Asp Leu Ile Thr Asp     #           460     - TTC GAC GCC GAC GAG GAC AGC ATC GAC CTG TC - #C GCG CTG GGC TTC ACC     3660     Phe Asp Ala Asp Glu Asp Ser Ile Asp Leu Se - #r Ala Leu Gly Phe Thr     #       475     - GGC CTG GGC GAC GGC TAC AAT GGC ACC CTG CT - #G CTG AAG ACC AAC GCC     3708     Gly Leu Gly Asp Gly Tyr Asn Gly Thr Leu Le - #u Leu Lys Thr Asn Ala     #   490     - GAG GGT ACG CGC ACC TAC CTG AAG AGC TAC GA - #A GCG GAC GCC CAG GGC     3756     Glu Gly Thr Arg Thr Tyr Leu Lys Ser Tyr Gl - #u Ala Asp Ala Gln Gly     495                 5 - #00                 5 - #05                 5 -     #10     - CGG CGC TTC GAG ATC GCC CTG GAC GGC AAC TT - #C ACC GGT CTG TTC AAC     3804     Arg Arg Phe Glu Ile Ala Leu Asp Gly Asn Ph - #e Thr Gly Leu Phe Asn     #               525     - GAC AAC AAC CTG TTG TTC GAC GCC GCT CCG GC - #C ACC GGT ACC GAG GGC     3852     Asp Asn Asn Leu Leu Phe Asp Ala Ala Pro Al - #a Thr Gly Thr Glu Gly     #           540     - AGC GAC AAC CTG CTC GGC ACC GAC GCC GGG GA - #A ACC CTC CTG GGC TAC     3900     Ser Asp Asn Leu Leu Gly Thr Asp Ala Gly Gl - #u Thr Leu Leu Gly Tyr     #       555     - GGC GGC AAC GAC ACC CTC AAC GGC GGG GCC GG - #C GAC GAC ATC CTG GTC     3948     Gly Gly Asn Asp Thr Leu Asn Gly Gly Ala Gl - #y Asp Asp Ile Leu Val     #   570     - GGC GGC GCC GGG CGC GAC AGC CTG ACC GGC GG - #C GCC GGG GCG GAC GTG     3996     Gly Gly Ala Gly Arg Asp Ser Leu Thr Gly Gl - #y Ala Gly Ala Asp Val     575                 5 - #80                 5 - #85                 5 -     #90     - TTC CGC TTC GAC GCG CTG TCC GAC AGC CAG CG - #C AAC TAC ACC ACC GGC     4044     Phe Arg Phe Asp Ala Leu Ser Asp Ser Gln Ar - #g Asn Tyr Thr Thr Gly     #               605     - GAC AAC CAG GCC GAC CGC ATT CTC GAC TTC GA - #C CCG ACC CTG GAC AGG     4092     Asp Asn Gln Ala Asp Arg Ile Leu Asp Phe As - #p Pro Thr Leu Asp Arg     #           620     - ATC GAC GTG TCG GCG CTG GGC TTC ACC GGG CT - #G GGC AAC GGC CGC AAC     4140     Ile Asp Val Ser Ala Leu Gly Phe Thr Gly Le - #u Gly Asn Gly Arg Asn     #       635     - GGC ACC CTC GCC GTG GTG CTC AAC AGC GCC GG - #C GAC CGC ACC GAT CTG     4188     Gly Thr Leu Ala Val Val Leu Asn Ser Ala Gl - #y Asp Arg Thr Asp Leu     #   650     - AAG AGC TAC GAC ACC GAC GCC AAC GGC TAC AG - #C TTC GAG CTT TCC CTC     4236     Lys Ser Tyr Asp Thr Asp Ala Asn Gly Tyr Se - #r Phe Glu Leu Ser Leu     655                 6 - #60                 6 - #65                 6 -     #70     - GCG GGC AAC TAC CAG GGG CAG CTC AGC GCC GA - #G CAG TTC GTT TTC GCG     4284     Ala Gly Asn Tyr Gln Gly Gln Leu Ser Ala Gl - #u Gln Phe Val Phe Ala     #               685     - ACG TCT CAG GGG GGA CAG ATG ACG ATT ATC GA - #A GGC ACC GAC GGC AAC     4332     Thr Ser Gln Gly Gly Gln Met Thr Ile Ile Gl - #u Gly Thr Asp Gly Asn     #           700     - GAT ACC TTG CAG GGC ACC GAG GCC AAC GAG CG - #G CTC CTC GGC CTG GAC     4380     Asp Thr Leu Gln Gly Thr Glu Ala Asn Glu Ar - #g Leu Leu Gly Leu Asp     #       715     - GGC CGG GAC AAC CTG AAC GGC GGC GCC GGC GA - #C GAC ATC CTC GAC GGC     4428     Gly Arg Asp Asn Leu Asn Gly Gly Ala Gly As - #p Asp Ile Leu Asp Gly     #   730     - GGA GCG GGG CGC GAC ACC CTG ACC GGC GGC AC - #G GGG GCC GAC ACC TTC     4476     Gly Ala Gly Arg Asp Thr Leu Thr Gly Gly Th - #r Gly Ala Asp Thr Phe     735                 7 - #40                 7 - #45                 7 -     #50     - CTG TTC TCC ACG CGT ACC GAC AGC TAC CGC AC - #C GAC AGC GCC AGC TTC     4524     Leu Phe Ser Thr Arg Thr Asp Ser Tyr Arg Th - #r Asp Ser Ala Ser Phe     #               765     - AAC GAC CTG ATC ACC GAC TTC GAT CCC ACC CA - #G GAC CGC ATC GAC CTG     4572     Asn Asp Leu Ile Thr Asp Phe Asp Pro Thr Gl - #n Asp Arg Ile Asp Leu     #           780     - TCC GGC CTG GGC TTC AGC GGT TTC GGC AAC GG - #C TAC GAC GGC ACC CTG     4620     Ser Gly Leu Gly Phe Ser Gly Phe Gly Asn Gl - #y Tyr Asp Gly Thr Leu     #       795     - CTG CTG CAG GTC AAC GCC GCG GGC ACC CGC AC - #C TAC CTG AAG AGT TTC     4668     Leu Leu Gln Val Asn Ala Ala Gly Thr Arg Th - #r Tyr Leu Lys Ser Phe     #   810     - GAG GCC GAT GCC AAC GGC CAG CGC TTC GAG AT - #C GCC CTG GAC GGC GAC     4716     Glu Ala Asp Ala Asn Gly Gln Arg Phe Glu Il - #e Ala Leu Asp Gly Asp     815                 8 - #20                 8 - #25                 8 -     #30     - TTC AGC GGC CAA TTG GAC AGC GGC AAC GTG AT - #C TTC GAG CCC GCC GTG     4764     Phe Ser Gly Gln Leu Asp Ser Gly Asn Val Il - #e Phe Glu Pro Ala Val     #               845     - TTC AAT GCC AAG GAC TTC GGC GCG CTG GGC GA - #C GGC GCC AGC GAC GAC     4812     Phe Asn Ala Lys Asp Phe Gly Ala Leu Gly As - #p Gly Ala Ser Asp Asp     #           860     - CGG CCG GCC ATC CAG GCG GCG ATC GAC GCC GC - #C TAC GCG GCC GGT GGC     4860     Arg Pro Ala Ile Gln Ala Ala Ile Asp Ala Al - #a Tyr Ala Ala Gly Gly     #       875     - GGC ACC GTC TAC CTG CCG GCC GGC GAG TAC CG - #G GTC AGC CCC ACC GGG     4908     Gly Thr Val Tyr Leu Pro Ala Gly Glu Tyr Ar - #g Val Ser Pro Thr Gly     #   890     - GAG CCG GGC GAC GGC TGC CTG ATG CTC AAG GA - #C GGC GTC TAC CTG GCC     4956     Glu Pro Gly Asp Gly Cys Leu Met Leu Lys As - #p Gly Val Tyr Leu Ala     895                 9 - #00                 9 - #05                 9 -     #10     - GGC GAC GGC ATA GGC GAA ACG GTC ATC AAG CT - #G ATC GAC GGC TCC GAC     5004     Gly Asp Gly Ile Gly Glu Thr Val Ile Lys Le - #u Ile Asp Gly Ser Asp     #               925     - CAG AAG ATC ACC GGC ATG GTG CGC TCG GCC TA - #T GGC GAA GAG ACC AGC     5052     Gln Lys Ile Thr Gly Met Val Arg Ser Ala Ty - #r Gly Glu Glu Thr Ser     #           940     - AAC TTC GGC ATG AGC GAC CTG ACC CTC GAC GG - #C AAC CGC GAC AAC ACC     5100     Asn Phe Gly Met Ser Asp Leu Thr Leu Asp Gl - #y Asn Arg Asp Asn Thr     #       955     - AGC GGC AAG GTC GAC GGC TGG TTC AAC GGC TA - #C ATC CCC GGC CAG GAC     5148     Ser Gly Lys Val Asp Gly Trp Phe Asn Gly Ty - #r Ile Pro Gly Gln Asp     #   970     - GGC GCC GAC CGC AAC GTG ACC ATC GAG CGG GT - #G GAA ATC CGC GAG ATG     5196     Gly Ala Asp Arg Asn Val Thr Ile Glu Arg Va - #l Glu Ile Arg Glu Met     975                 9 - #80                 9 - #85                 9 -     #90     - TCC GGC TAT GGC TTC GAT CCG CAC GAG CAG AC - #C ATC AAC CTG ACG ATC     5244     Ser Gly Tyr Gly Phe Asp Pro His Glu Gln Th - #r Ile Asn Leu Thr Ile     #              10050     - CGC GAC AGC GTG GCC CAC GAC AAC GGC CTC GA - #C GGC TTC GTC GCC GAC     5292     Arg Asp Ser Val Ala His Asp Asn Gly Leu As - #p Gly Phe Val Ala Asp     #          10205     - TAC CTG GTC GAC AGC GTG TTC GAG AAC AAC GT - #C GCC TAC AAC AAC GAC     5340     Tyr Leu Val Asp Ser Val Phe Glu Asn Asn Va - #l Ala Tyr Asn Asn Asp     #      10350     - CGC CAC GGC TTC AAC ATC GTC ACC AGC ACC TA - #C GAT TTC GTC ATG ACC     5388     Arg His Gly Phe Asn Ile Val Thr Ser Thr Ty - #r Asp Phe Val Met Thr     #  10505     - AAC AAC GTC GCC TAC GGC AAC GGC GGC GCC GG - #C CTG ACG ATC CAG CGG     5436     Asn Asn Val Ala Tyr Gly Asn Gly Gly Ala Gl - #y Leu Thr Ile Gln Arg     #               10701060 - #                1065     - GGC TCG GAG GAC CTG GCC CAG CCG ACC GAT AT - #C CTG ATC GAC GGC GGC     5484     Gly Ser Glu Asp Leu Ala Gln Pro Thr Asp Il - #e Leu Ile Asp Gly Gly     #              10850     - GCC TAC TAC GAC AAC GCC CTG GAA GGC GTG CT - #G TTC AAG ATG ACC AAC     5532     Ala Tyr Tyr Asp Asn Ala Leu Glu Gly Val Le - #u Phe Lys Met Thr Asn     #          11005     - AAC GTC ACC CTG CAG AAC GCC GAG ATC TAC GG - #C AAC GGC TCC TCC GGC     5580     Asn Val Thr Leu Gln Asn Ala Glu Ile Tyr Gl - #y Asn Gly Ser Ser Gly     #      11150     - GTG CGC CTG TAC GGC ACG GAG GAC GTG CAG AT - #C CTC GAC AAC CAG ATC     5628     Val Arg Leu Tyr Gly Thr Glu Asp Val Gln Il - #e Leu Asp Asn Gln Ile     #  11305     - CAC GAC AAT TCG CAG AAC GGC ACC TAT CCG GA - #A GTC CTG CTG CAG GCC     5676     His Asp Asn Ser Gln Asn Gly Thr Tyr Pro Gl - #u Val Leu Leu Gln Ala     #               11501140 - #                1145     - TTC GAC GAC AGC CAG GTC ACC GGT GAG CTG TA - #C GAG ACC CTG AAC ACC     5724     Phe Asp Asp Ser Gln Val Thr Gly Glu Leu Ty - #r Glu Thr Leu Asn Thr     #              11650     - CGG ATC GAA GGC AAT CTC ATC GAC GCT TCG GA - #C AAC GCC AAC TAT GCG     5772     Arg Ile Glu Gly Asn Leu Ile Asp Ala Ser As - #p Asn Ala Asn Tyr Ala     #          11805     - GTG CGC GAG CGC GAC GAC GGC AGC GAC TAC AC - #C ACG CTC GTG GAC AAC     5820     Val Arg Glu Arg Asp Asp Gly Ser Asp Tyr Th - #r Thr Leu Val Asp Asn     #      11950     - GAC ATC AGC GGC GGC CAG GTC GCC TCG GTG CA - #G CTT TCC GGC GCC CAT     5868     Asp Ile Ser Gly Gly Gln Val Ala Ser Val Gl - #n Leu Ser Gly Ala His     #  12105     - TCG AGT CTT TCC GGC GGC ACC GTC GAA GTG CC - #G CAG GGG ACC GAC GGC     5916     Ser Ser Leu Ser Gly Gly Thr Val Glu Val Pr - #o Gln Gly Thr Asp Gly     #               12301220 - #                1225     - AAC GAC GTG CTG GTC GGC AGC GAT GCC AAC GA - #C CAG CTC TAC GGC GGA     5964     Asn Asp Val Leu Val Gly Ser Asp Ala Asn As - #p Gln Leu Tyr Gly Gly     #              12450     - GCC GGC GAC GAC CGC CTG GAC GGC GGC GCC GG - #T GAC GAC CTG CTC GAC     6012     Ala Gly Asp Asp Arg Leu Asp Gly Gly Ala Gl - #y Asp Asp Leu Leu Asp     #          12605     - GGC GGA GCG GGG CGC GAC GAC CTG ACC GGC GG - #C ACG GGT GCC GAC ACC     6060     Gly Gly Ala Gly Arg Asp Asp Leu Thr Gly Gl - #y Thr Gly Ala Asp Thr     #      12750     - TTC GTG TTC GCC GCG CGT ACC GAT AGC TAC CG - #C ACC GAC GCG GGG GTG     6108     Phe Val Phe Ala Ala Arg Thr Asp Ser Tyr Ar - #g Thr Asp Ala Gly Val     #  12905     - TTC AAC GAC CTG ATC CTC GAC TTC GAC GCC AG - #C GAG GAC CGC ATC GAC     6156     Phe Asn Asp Leu Ile Leu Asp Phe Asp Ala Se - #r Glu Asp Arg Ile Asp     #               13101300 - #                1305     - CTG TCC GCC CTG GGT TTC AGC GGC TTC GGC GA - #C GGC TAC AAC GGC ACC     6204     Leu Ser Ala Leu Gly Phe Ser Gly Phe Gly As - #p Gly Tyr Asn Gly Thr     #              13250     - CTG CTG GTG CAG CTC AGC AGC GCC GGA ACC CG - #T ACC TAC CTC AAG AGC     6252     Leu Leu Val Gln Leu Ser Ser Ala Gly Thr Ar - #g Thr Tyr Leu Lys Ser     #          13405     - TAC GAG GAG GAC CTC GAG GGC CGG CGC TTC GA - #G GTC GCC CTG GAC GGC     6300     Tyr Glu Glu Asp Leu Glu Gly Arg Arg Phe Gl - #u Val Ala Leu Asp Gly     #      13550     - GAC CAC ACG GGC GAT CTT TCC GCC GCC AAT GT - #G GTT TTC GCC GAC GAC     6348     Asp His Thr Gly Asp Leu Ser Ala Ala Asn Va - #l Val Phe Ala Asp Asp     #  13705     - GGC TCG GCC GCC GTG GCG AGC AGC GAT CCC GC - #C GCC ACA CAG TTG GAG     6396     Gly Ser Ala Ala Val Ala Ser Ser Asp Pro Al - #a Ala Thr Gln Leu Glu     #               13901380 - #                1385     - GTG GTC GGC AGC AGC GGC ACC CAG ACC GAT CA - #A CTC GCC TGA     #6438     Val Val Gly Ser Ser Gly Thr Gln Thr Asp Gl - #n Leu Ala  *     #               1400     - TCCGACCCCG CCCATACCCG CCCGGCCATT CCGGCCGGGC GAACCAATGG TC - #TTCAGGCC     6498     - AGTCTCAGGC ACAGCAGCGC GCGAGCCGCT TCGCTTTGTC CGCCCCCCGC TT - #TTCTCGCT     6558     - GAACGCGACG ATCGCCGGGC GCCGGGGAAG GGTTCGCCGC ATGCCGAGCC GG - #GGACGGGA     6618     - AAAGCCTGTT CGACCAGTCG ACTCTTCCTC CCTTCACTTT CCAGGCAGCC TG - #CGGGCTGC     6678     - GCAGTAACGG AACAGGAAGC AGC ATG GAT TAC AAC GTC AA - #A GAT TTC GGG     6728     #Asp Phe Gly Asp Tyr Asn Val Lys     #       5  1     - GCG CTG GGC GAT GGC GTC AGC GAC GAT ACG GC - #C GCC ATC CAG GCG GCG     6776     Ala Leu Gly Asp Gly Val Ser Asp Asp Thr Al - #a Ala Ile Gln Ala Ala     # 25     - ATC GAT GCC GCC TAC GCG GCC GGC GGC GGC AC - #C GTC TAC CTG CCG GCC     6824     Ile Asp Ala Ala Tyr Ala Ala Gly Gly Gly Th - #r Val Tyr Leu Pro Ala     #                 40     - GGC GAA TAC CGG GTC AGC GGC GGC GAG GAG CC - #T TCC GAT GGT TGC CTG     6872     Gly Glu Tyr Arg Val Ser Gly Gly Glu Glu Pr - #o Ser Asp Gly Cys Leu     #             55     - ACC ATC AAG AGC AAC GTC CAT ATC GTC GGC GC - #G GGG ATG GGC GAG ACG     6920     Thr Ile Lys Ser Asn Val His Ile Val Gly Al - #a Gly Met Gly Glu Thr     #         70     - GTC ATC AAG CTG GTC GAC GGC TGG GAT CAG GA - #C GTC ACC GGC ATC GTC     6968     Val Ile Lys Leu Val Asp Gly Trp Asp Gln As - #p Val Thr Gly Ile Val     #     85     - CGC TCG GCC TAC GGC GAG GAG ACC AGC AAC TT - #C GGC ATG AGC GAC CTG     7016     Arg Ser Ala Tyr Gly Glu Glu Thr Ser Asn Ph - #e Gly Met Ser Asp Leu     #105     - ACC CTC GAC GGC AAC CGC GAC AAC ACC AGC GG - #C AAG GTC GAC GGC TGG     7064     Thr Leu Asp Gly Asn Arg Asp Asn Thr Ser Gl - #y Lys Val Asp Gly Trp     #               120     - TTC AAC GGC TAC ATT CCC GGC GAG GAC GGC GC - #C GAC CGC GAC GTG ACC     7112     Phe Asn Gly Tyr Ile Pro Gly Glu Asp Gly Al - #a Asp Arg Asp Val Thr     #           135     - CTG GAG CGG GTG GAA ATC CGT GAA ATG TCC GG - #T TAC GGT TTC GAT CCG     7160     Leu Glu Arg Val Glu Ile Arg Glu Met Ser Gl - #y Tyr Gly Phe Asp Pro     #       150     - CAC GAG CAG ACC ATC AAC CTG ACG ATC CGC GA - #C AGC GTG GCC CAC GAC     7208     His Glu Gln Thr Ile Asn Leu Thr Ile Arg As - #p Ser Val Ala His Asp     #   165     - AAC GGC CTC GAC GGC TTC GTC GCC GAT TTC CA - #G ATC GGC GGG GTG TTC     7256     Asn Gly Leu Asp Gly Phe Val Ala Asp Phe Gl - #n Ile Gly Gly Val Phe     170                 1 - #75                 1 - #80                 1 -     #85     - GAG AAC AAC GTC TCG TAC AAC AAC GAC CGC CA - #C GGC TTC AAC ATC GTC     7304     Glu Asn Asn Val Ser Tyr Asn Asn Asp Arg Hi - #s Gly Phe Asn Ile Val     #               200     - ACC AGC ACC AAC GAC TTC GTC CTG AGC AAC AA - #C GTC GCC TAC GGC AAC     7352     Thr Ser Thr Asn Asp Phe Val Leu Ser Asn As - #n Val Ala Tyr Gly Asn     #           215     - GGC GGC GCC GGC CTG GTG GTG CAG CGC GGC TC - #G TCC GAC GTG GCG CAC     7400     Gly Gly Ala Gly Leu Val Val Gln Arg Gly Se - #r Ser Asp Val Ala His     #       230     - CCC TAC GAC ATC CTG ATC GAC GGC GGC GCC TA - #C TAC GAC AAC GGC CTG     7448     Pro Tyr Asp Ile Leu Ile Asp Gly Gly Ala Ty - #r Tyr Asp Asn Gly Leu     #   245     - GAA GGC GTG CAG ATC AAG ATG GCC CAC GAC GT - #C ACC CTG CAG AAC GCC     7496     Glu Gly Val Gln Ile Lys Met Ala His Asp Va - #l Thr Leu Gln Asn Ala     250                 2 - #55                 2 - #60                 2 -     #65     - GAG ATC TAC GGC AAC GGC CTA TAC GGG GTG CG - #C GTC TAC GGC GCC GAG     7544     Glu Ile Tyr Gly Asn Gly Leu Tyr Gly Val Ar - #g Val Tyr Gly Ala Glu     #               280     - GAT GTG CAG ATC CTC GAC AAC TAC ATC CAC GA - #C AAT TCG CAG AAC GGT     7592     Asp Val Gln Ile Leu Asp Asn Tyr Ile His As - #p Asn Ser Gln Asn Gly     #           295     - TCC TAC GCG GAA ATC CTC CTG CAG TCC TAC GA - #C GAT ACC GCC GGG GTG     7640     Ser Tyr Ala Glu Ile Leu Leu Gln Ser Tyr As - #p Asp Thr Ala Gly Val     #       310     - TCC GGC AAT TTC TAC ACC ACC ACC GGC ACC TG - #G ATC GAA GGC AAC ACC     7688     Ser Gly Asn Phe Tyr Thr Thr Thr Gly Thr Tr - #p Ile Glu Gly Asn Thr     #   325     - ATC GTC GGC TCG GCC AAC TCC ACC TAT GGC AT - #C CAG GAG CGC GAC GAC     7736     Ile Val Gly Ser Ala Asn Ser Thr Tyr Gly Il - #e Gln Glu Arg Asp Asp     330                 3 - #35                 3 - #40                 3 -     #45     - GGC ACC GAC TAC AGC AGC CTC TAC GCC AAC AG - #C GTC AGC AAT GTG CAG     7784     Gly Thr Asp Tyr Ser Ser Leu Tyr Ala Asn Se - #r Val Ser Asn Val Gln     #               360     - AAC GGC TCG GTG CGC CTC TAC GGC GCC AAC TC - #C GTC GTC TCC GAC CTG     7832     Asn Gly Ser Val Arg Leu Tyr Gly Ala Asn Se - #r Val Val Ser Asp Leu     #           375     - CCC GGC ACC GGC CAG CAG GCG ACC CTC GAA GG - #C ACG GCC GGC AAC GAC     7880     Pro Gly Thr Gly Gln Gln Ala Thr Leu Glu Gl - #y Thr Ala Gly Asn Asp     #       390     - ACG CTT GGC GGC AGC GAC GCC CAC GAG ACG CT - #G CTC GGG CTG GAC GGC     7928     Thr Leu Gly Gly Ser Asp Ala His Glu Thr Le - #u Leu Gly Leu Asp Gly     #   405     - AAC GAC CGC CTG AAC GGC GGC GCC GGC AAC GA - #C ATC CTC GAC GGC GGC     7976     Asn Asp Arg Leu Asn Gly Gly Ala Gly Asn As - #p Ile Leu Asp Gly Gly     410                 4 - #15                 4 - #20                 4 -     #25     - GCC GGG CGC GAC AAC CTG ACC GGC GGC GCG GG - #C GCC GAC CTG TTC CGC     8024     Ala Gly Arg Asp Asn Leu Thr Gly Gly Ala Gl - #y Ala Asp Leu Phe Arg     #               440     - GTC TCC GCG CGC ACC GAC AGC TAC CGC ACC GA - #C AGC GCC AGC TTC AAC     8072     Val Ser Ala Arg Thr Asp Ser Tyr Arg Thr As - #p Ser Ala Ser Phe Asn     #           455     - GAC CTG ATC ACC GAC TTC GAC GCC AGC CAG GA - #C CGC ATC GAC CTG TCC     8120     Asp Leu Ile Thr Asp Phe Asp Ala Ser Gln As - #p Arg Ile Asp Leu Ser     #       470     - GCG CTG GGC TTC ACC GGG CTG GGC GAC GGC TA - #T AAC GGC ACC CTG CTG     8168     Ala Leu Gly Phe Thr Gly Leu Gly Asp Gly Ty - #r Asn Gly Thr Leu Leu     #   485     - CTG CAG GTC AGC GCC GAC GGC AGC CGC ACC TA - #T CTG AAG AGC CTG GAG     8216     Leu Gln Val Ser Ala Asp Gly Ser Arg Thr Ty - #r Leu Lys Ser Leu Glu     490                 4 - #95                 5 - #00                 5 -     #05     - GCG GAT GCC GAG GGG CGG CGT TTC GAG ATC GC - #C CTG GAC GGC AAC TTC     8264     Ala Asp Ala Glu Gly Arg Arg Phe Glu Ile Al - #a Leu Asp Gly Asn Phe     #               520     - GCC GGC CTG CTC GGT GCC GGC AAC CTG CTC TT - #C GAG CGC ACC GCC ATC     8312     Ala Gly Leu Leu Gly Ala Gly Asn Leu Leu Ph - #e Glu Arg Thr Ala Ile     #           535     - GAG GGG GAT GCC GGC GAC AAC GCC CTG CTC GG - #T ACC TCG GCC GCC GAG     8360     Glu Gly Asp Ala Gly Asp Asn Ala Leu Leu Gl - #y Thr Ser Ala Ala Glu     #       550     - ACA TTG CTC GGC CAC GCC GGC AAC GAC ACG CT - #C GAC GGC GGG GCC GGC     8408     Thr Leu Leu Gly His Ala Gly Asn Asp Thr Le - #u Asp Gly Gly Ala Gly     #   565     - GAC GAC ATC CTG GTC GGC GGC GCC GGG CGC GA - #C AGC CTC ACC GGC GGC     8456     Asp Asp Ile Leu Val Gly Gly Ala Gly Arg As - #p Ser Leu Thr Gly Gly     570                 5 - #75                 5 - #80                 5 -     #85     - GCC GGA GCG GAC GTG TTC CGC TTC GAC GCG CT - #G TCC GAC AGC CAG CGC     8504     Ala Gly Ala Asp Val Phe Arg Phe Asp Ala Le - #u Ser Asp Ser Gln Arg     #               600     - AAC TAC GAC ATC GGC GAC AAC CAG GGC GAC CG - #C ATC GCC GAC TTC GCG     8552     Asn Tyr Asp Ile Gly Asp Asn Gln Gly Asp Ar - #g Ile Ala Asp Phe Ala     #           615     - GTG GGC GAA GAC AAG CTC GAC GTA TCG GCG CT - #G GGC TTC ACC GGG CTG     8600     Val Gly Glu Asp Lys Leu Asp Val Ser Ala Le - #u Gly Phe Thr Gly Leu     #       630     - GGC GAC GGC TAC AAC GGC ACC CTC GCC CTG GT - #G CTC AAC AGC GCC GGC     8648     Gly Asp Gly Tyr Asn Gly Thr Leu Ala Leu Va - #l Leu Asn Ser Ala Gly     #   645     - GAC CGC ACC TAC GTG AAA AGC TAC GAG AAC GG - #C GCC GAC GGC TAC CGC     8696     Asp Arg Thr Tyr Val Lys Ser Tyr Glu Asn Gl - #y Ala Asp Gly Tyr Arg     650                 6 - #55                 6 - #60                 6 -     #65     - TTC GAG TTT TCC CTC GAC GGC AAC TAT CTG GA - #G CTA CTC GGC AAC GAG     8744     Phe Glu Phe Ser Leu Asp Gly Asn Tyr Leu Gl - #u Leu Leu Gly Asn Glu     #               680     - GAT TTC ATC TTC GCC ACG CCC AGC GGC CAG CA - #A CTC CTC GAA GGC AGC     8792     Asp Phe Ile Phe Ala Thr Pro Ser Gly Gln Gl - #n Leu Leu Glu Gly Ser     #           695     - GCC GGC AAC GAC AGC CTG CAG GGC ACG GCC GC - #C GAC GAG GTG ATC CAC     8840     Ala Gly Asn Asp Ser Leu Gln Gly Thr Ala Al - #a Asp Glu Val Ile His     #       710     - GGC GGC GGC GGG CGC GAC ACG CTG GCC GGA GG - #G GCC GGG GCC GAC GTG     8888     Gly Gly Gly Gly Arg Asp Thr Leu Ala Gly Gl - #y Ala Gly Ala Asp Val     #   725     - TTC CGC TTT AGC GAA CTG ACC GAC AGC TAC CG - #A GAC AGT GCC AGC TAT     8936     Phe Arg Phe Ser Glu Leu Thr Asp Ser Tyr Ar - #g Asp Ser Ala Ser Tyr     730                 7 - #35                 7 - #40                 7 -     #45     - GCC GAT CTG ATC ACT GAC TTC GAT GCC AGC GA - #G GAT CGT ATC GAC CTG     8984     Ala Asp Leu Ile Thr Asp Phe Asp Ala Ser Gl - #u Asp Arg Ile Asp Leu     #               760     - TCC GGC CTC GGC TTC AGC GGT CTG GGC AAC GG - #C TAC GGC GGT ACC CTG     9032     Ser Gly Leu Gly Phe Ser Gly Leu Gly Asn Gl - #y Tyr Gly Gly Thr Leu     #           775     - GCG CTG CAG GTG AAC AGC GCC GGT ACC CGC AC - #C TAC CTG AAG AGC TTC     9080     Ala Leu Gln Val Asn Ser Ala Gly Thr Arg Th - #r Tyr Leu Lys Ser Phe     #       790     - GAG ACC AAC GCC GCC GGC GAG CGT TTC GAG AT - #C GCC CTG GAC GGC GAC     9128     Glu Thr Asn Ala Ala Gly Glu Arg Phe Glu Il - #e Ala Leu Asp Gly Asp     #   805     - CTG TCC GCG CTC GGC GGG GCC AAC CTG ATC CT - #C GAC GCG CGT ACC GTA     9176     Leu Ser Ala Leu Gly Gly Ala Asn Leu Ile Le - #u Asp Ala Arg Thr Val     810                 8 - #15                 8 - #20                 8 -     #25     - CTG GCG GGC GGC GAC GGC AAC GAC ACG CTT TC - #C GGC AGC AGC GCG GCC     9224     Leu Ala Gly Gly Asp Gly Asn Asp Thr Leu Se - #r Gly Ser Ser Ala Ala     #               840     - GAG GAA CTG CTC GGC GGG GTC GGC AAC GAC AG - #C CTG GAC GGC GGC GCC     9272     Glu Glu Leu Leu Gly Gly Val Gly Asn Asp Se - #r Leu Asp Gly Gly Ala     #           855     - GGC AAC GAC ATC CTC GAC GGC GGG GCG GGG CG - #C GAC ACC CTG AGT GGC     9320     Gly Asn Asp Ile Leu Asp Gly Gly Ala Gly Ar - #g Asp Thr Leu Ser Gly     #       870     - GGC AGC GGC AGC GAC ATC TTC CGC TTC GGC GG - #C GCG CTC GAC AGC TTC     9368     Gly Ser Gly Ser Asp Ile Phe Arg Phe Gly Gl - #y Ala Leu Asp Ser Phe     #   885     - CGC AAC TAC GCC AGC GGG ACG AAC GGC ACC GA - #C AGC ATC ACC GAC TTC     9416     Arg Asn Tyr Ala Ser Gly Thr Asn Gly Thr As - #p Ser Ile Thr Asp Phe     890                 8 - #95                 9 - #00                 9 -     #05     - ACC CCC GGC GAG GAT CTG ATC GAC CTC TCC GT - #G CTC GGC TAC ACC GGG     9464     Thr Pro Gly Glu Asp Leu Ile Asp Leu Ser Va - #l Leu Gly Tyr Thr Gly     #               920     - CTG GGC GAC GGC TAC AAC GGT ACC CTG GCG AT - #A GTG CTG AAC GAC GCC     9512     Leu Gly Asp Gly Tyr Asn Gly Thr Leu Ala Il - #e Val Leu Asn Asp Ala     #           935     - GGC ACC AAG ACC TAC CTG AAA AAC CGC GAG AG - #C GAC GCC GAA GGC AAC     9560     Gly Thr Lys Thr Tyr Leu Lys Asn Arg Glu Se - #r Asp Ala Glu Gly Asn     #       950     - CAG TTC GAG ATC GCC CTG GAG GGC AAC CAC GC - #C GAC CAG CTC GAT GCG     9608     Gln Phe Glu Ile Ala Leu Glu Gly Asn His Al - #a Asp Gln Leu Asp Ala     #   965     - AGC GAC TTC ATC TTC GCC ACG GCG GCC GCG AC - #C ACC GGA ATC GAG GTG     9656     Ser Asp Phe Ile Phe Ala Thr Ala Ala Ala Th - #r Thr Gly Ile Glu Val     970                 9 - #75                 9 - #80                 9 -     #85     - GTC GGC GGC AGC GGC ACC CAG ACC GAT CAG CT - #C GCC TGA TCCGACCCCG     9705     Val Gly Gly Ser Gly Thr Gln Thr Asp Gln Le - #u Ala  *     #               995     - CCCGCACCCG CCCGGCCATT CCGGCCGGGC GAACCAATGG CCTTTTGATC AG - #TCTCAGGC     9765     - ACAGCAACGT GTGCGCCGCT TCGCTTGTTC GCCCTCCCGG CCTTGTTTCT CG - #CTGAAAGC     9825     - GACGATCGCC GGGGGCGTGC CGGGCGCGAG AAAAGGTTCG CCGTGTGCAA AG - #CCGGGGAC     9885     - GGGAAAAGCC TGTTCAAGTA GTCGACTCTT CCTTCTCCTT TTTCCTAGAC GG - #CCTCTTGG     9945     #AAA GAT TTC       9996 AAGCAGC ATG GAC TTC AAC GTC     #            Met Asp Phe As - #n Val Lys Asp Phe     #           5  1     - GGG GCA CTG GGC GAT GGC GCC AGC GAC GAC AC - #G GCG GCC ATC CAG GCG     10044     Gly Ala Leu Gly Asp Gly Ala Ser Asp Asp Th - #r Ala Ala Ile Gln Ala     #     20     - GCG ATC GAT GCC GCC CAC GCG GCG GGC GGC GG - #C ACC GTC TAC CTG CCG     10092     Ala Ile Asp Ala Ala His Ala Ala Gly Gly Gl - #y Thr Val Tyr Leu Pro     # 40     - GCT GGC GAG TAT CGG GTC AGC GGC GGC GAG GA - #G CCT TCC GAC GGC GCG     10140     Ala Gly Glu Tyr Arg Val Ser Gly Gly Glu Gl - #u Pro Ser Asp Gly Ala     #                 55     - CTG ACC ATC AAG AGC AAC GTC TAT ATC GTC GG - #C GCC GGG ATG GGC GAG     10188     Leu Thr Ile Lys Ser Asn Val Tyr Ile Val Gl - #y Ala Gly Met Gly Glu     #             70     - ACG GTG ATC AAG ATG GTC GAC GGC TGG ACG CA - #G AAC GTC ACC GGC ATG     10236     Thr Val Ile Lys Met Val Asp Gly Trp Thr Gl - #n Asn Val Thr Gly Met     #         85     - GTG CGC TCG GCC TAT GGC GAG GAG ACC AGC AA - #C TTC GGC ATG AGC GAC     10284     Val Arg Ser Ala Tyr Gly Glu Glu Thr Ser As - #n Phe Gly Met Ser Asp     #    100     - CTG ACC CTC GAC GGC AAC CGC GAC AAC CTG TC - #C GCC AAG GTC GAC GGC     10332     Leu Thr Leu Asp Gly Asn Arg Asp Asn Leu Se - #r Ala Lys Val Asp Gly     105                 1 - #10                 1 - #15                 1 -     #20     - TGG TTC AAC GGC TAC ATT CCC GGC CAG GAC GG - #T GCC GAT CGC GAC GTG     10380     Trp Phe Asn Gly Tyr Ile Pro Gly Gln Asp Gl - #y Ala Asp Arg Asp Val     #               135     - ACC CTG GAG CGG GTG GAA ATC CGC GAA ATG TC - #C GGT TAC GGT TTC GAT     10428     Thr Leu Glu Arg Val Glu Ile Arg Glu Met Se - #r Gly Tyr Gly Phe Asp     #           150     - CCG CAC GAG CAG ACC ATC AAC CTG ACG ATC CG - #C GAC AGC GTG GCC CAC     10476     Pro His Glu Gln Thr Ile Asn Leu Thr Ile Ar - #g Asp Ser Val Ala His     #       165     - GAC AAC GGC CTC GAC GGC TTC GTC GCC GAC TA - #C CAG GTC GGC GGG GTG     10524     Asp Asn Gly Leu Asp Gly Phe Val Ala Asp Ty - #r Gln Val Gly Gly Val     #   180     - TTC GAG AAC AAC GTC TCG TAC AAC AAC GAC CG - #C CAC GGC TTC AAC ATC     10572     Phe Glu Asn Asn Val Ser Tyr Asn Asn Asp Ar - #g His Gly Phe Asn Ile     185                 1 - #90                 1 - #95                 2 -     #00     - GTC ACC AGC ACC AAC GAC TTC GTC CTG AGC AA - #C AAC GTC GCC TAC GGC     10620     Val Thr Ser Thr Asn Asp Phe Val Leu Ser As - #n Asn Val Ala Tyr Gly     #               215     - AAC GGC GGC GCC GGC CTG GTG GTG CAG CGC GG - #C TCG TAC GAC CTG CCC     10668     Asn Gly Gly Ala Gly Leu Val Val Gln Arg Gl - #y Ser Tyr Asp Leu Pro     #           230     - CAT CCC TAC GAC ATC CTG ATC GAC GGC GGC GC - #C TAC TAC GAC AAC GCC     10716     His Pro Tyr Asp Ile Leu Ile Asp Gly Gly Al - #a Tyr Tyr Asp Asn Ala     #       245     - TTG GAA GGC GTG CAG CTC AAG ATG ACC CAC GA - #C GTC ACC CTG CAG AAC     10764     Leu Glu Gly Val Gln Leu Lys Met Thr His As - #p Val Thr Leu Gln Asn     #   260     - GCC GAG ATC TAT GGC AAC GGC CTG TAC GGG GT - #G CGC GTC TAC GGC GCC     10812     Ala Glu Ile Tyr Gly Asn Gly Leu Tyr Gly Va - #l Arg Val Tyr Gly Ala     265                 2 - #70                 2 - #75                 2 -     #80     - CAG GAC GTG CAA CTC CTC GAT AAC CAG ATC CA - #C GAC AAT TCG CAG AAC     10860     Gln Asp Val Gln Leu Leu Asp Asn Gln Ile Hi - #s Asp Asn Ser Gln Asn     #               295     - GGC GCC TAT GCC GAA GTC CTG CTG CAG TCC TA - #C GAC GAC ACC GCC GGG     10908     Gly Ala Tyr Ala Glu Val Leu Leu Gln Ser Ty - #r Asp Asp Thr Ala Gly     #           310     - GTG TCC GGC AAC TTT TAC GTC ACC ACC GGC AC - #C TGG CTC GAA GGC AAC     10956     Val Ser Gly Asn Phe Tyr Val Thr Thr Gly Th - #r Trp Leu Glu Gly Asn     #       325     - GTC ATC AGC GGC TCG GCC AAT TCC ACC TTC GG - #C ATC CAG GAG CGC GCC     11004     Val Ile Ser Gly Ser Ala Asn Ser Thr Phe Gl - #y Ile Gln Glu Arg Ala     #   340     - GAC GGC ACC GAC TAC AGC AGC CTT TAC GCC AA - #T ACC ATC GAC GGC GTG     11052     Asp Gly Thr Asp Tyr Ser Ser Leu Tyr Ala As - #n Thr Ile Asp Gly Val     345                 3 - #50                 3 - #55                 3 -     #60     - CAG AAC GGG ACG GTA CGG CTG TAT GGC GCC AA - #C TCC ACG GTT TCC GAG     11100     Gln Asn Gly Thr Val Arg Leu Tyr Gly Ala As - #n Ser Thr Val Ser Glu     #               375     - CAG CCC AGC AGC GGC CAG CAG GCG ACC CTC GA - #A GGC ACC GCG GGC AAC     11148     Gln Pro Ser Ser Gly Gln Gln Ala Thr Leu Gl - #u Gly Thr Ala Gly Asn     #           390     - GAC GTG CTC AGC GGA ACG GGT GCC CAC GAG CT - #G ATT CTC GGC CTG GCC     11196     Asp Val Leu Ser Gly Thr Gly Ala His Glu Le - #u Ile Leu Gly Leu Ala     #       405     - GGC AAC GAT CGC CTG GAC GGT GGC GCC GGC GA - #C GAC ACC CTC GAC GGC     11244     Gly Asn Asp Arg Leu Asp Gly Gly Ala Gly As - #p Asp Thr Leu Asp Gly     #   420     - GGC GCG GGG CGC GAT ACC CTG ACC GGC GGC GC - #G GGC GCC GAT ACC TTC     11292     Gly Ala Gly Arg Asp Thr Leu Thr Gly Gly Al - #a Gly Ala Asp Thr Phe     425                 4 - #30                 4 - #35                 4 -     #40     - CGC TTC TCT GCC CGC GAG GAC AGT CAC CGC AC - #C GAC AGC GCC AGC TTC     11340     Arg Phe Ser Ala Arg Glu Asp Ser His Arg Th - #r Asp Ser Ala Ser Phe     #               455     - ACC GAC CTG ATC ACC GAC TTC GAC GCC AGC CA - #G GAC CGC ATC GAC CTC     11388     Thr Asp Leu Ile Thr Asp Phe Asp Ala Ser Gl - #n Asp Arg Ile Asp Leu     #           470     - TCC GCG CTG GGC TTC ACC GGT CTG GGC AAC GG - #T TAT GAC GGC ACC CTG     11436     Ser Ala Leu Gly Phe Thr Gly Leu Gly Asn Gl - #y Tyr Asp Gly Thr Leu     #       485     - GCG GTG ACC ACC GGT TCC GGC GGC ACC CGC AC - #C TAC CTG AAG AGC TAC     11484     Ala Val Thr Thr Gly Ser Gly Gly Thr Arg Th - #r Tyr Leu Lys Ser Tyr     #   500     - GAG GTG GAC GCC CAG GGC CGG CGT TTC GAA AT - #C GCC CTG GAC GGC AAC     11532     Glu Val Asp Ala Gln Gly Arg Arg Phe Glu Il - #e Ala Leu Asp Gly Asn     505                 5 - #10                 5 - #15                 5 -     #20     - TTC GTC GGC CAG TTC AAC GAT GGC AAC CTG TT - #G TTC GAC GCC GCT CCG     11580     Phe Val Gly Gln Phe Asn Asp Gly Asn Leu Le - #u Phe Asp Ala Ala Pro     #               535     - GTC ACC GGT ACC GAG GGC AAC GAC AAC CTG TC - #C GGC ACC GAT GCC GGG     11628     Val Thr Gly Thr Glu Gly Asn Asp Asn Leu Se - #r Gly Thr Asp Ala Gly     #           550     - GAA ACC CTC CTG GGC TAC GGC GGC AAC GAC AC - #C CTC AAC GGC GGG GCC     11676     Glu Thr Leu Leu Gly Tyr Gly Gly Asn Asp Th - #r Leu Asn Gly Gly Ala     #       565     - GGC AAC GAC ATC CTG GTC GGC GGC GCC GGG CG - #C GAC ACC CTG ACC GGC     11724     Gly Asn Asp Ile Leu Val Gly Gly Ala Gly Ar - #g Asp Thr Leu Thr Gly     #   580     - GGC GCC GGG GCG GAC GTG TTC CGC TTC GAG GC - #G CTG TCC GAC AGC CAG     11772     Gly Ala Gly Ala Asp Val Phe Arg Phe Glu Al - #a Leu Ser Asp Ser Gln     585                 5 - #90                 5 - #95                 6 -     #00     - CGC AAC TAC ACC GCC GGC GAC AAC CAG GGC GA - #T TAC ATC ATC GAC TTC     11820     Arg Asn Tyr Thr Ala Gly Asp Asn Gln Gly As - #p Tyr Ile Ile Asp Phe     #               615     - GCC GTG GGC GAA GAC AGG ATC GAC GTA TCG GC - #G CTG GGT TAC ACC GGG     11868     Ala Val Gly Glu Asp Arg Ile Asp Val Ser Al - #a Leu Gly Tyr Thr Gly     #           630     - CTG GGC AAC GGC CGC AAC GGC ACC CTC GCC GT - #G GTG CTC AAC AGC GCC     11916     Leu Gly Asn Gly Arg Asn Gly Thr Leu Ala Va - #l Val Leu Asn Ser Ala     #       645     - GGC GAC CGC ACC TAC GTG AAG AGC TAC GAC AC - #T GAC GCC AAC GGC TAT     11964     Gly Asp Arg Thr Tyr Val Lys Ser Tyr Asp Th - #r Asp Ala Asn Gly Tyr     #   660     - AAC TTC GAG CTT TCC CTC GCG GGC AAC TAC CA - #G GGG CTG CTC GGC GCC     12012     Asn Phe Glu Leu Ser Leu Ala Gly Asn Tyr Gl - #n Gly Leu Leu Gly Ala     665                 6 - #70                 6 - #75                 6 -     #80     - GAA CAG TTC GTC TTC GCC ACG CCC CCG GAA CA - #G GCG ACC ATC GAG GGA     12060     Glu Gln Phe Val Phe Ala Thr Pro Pro Glu Gl - #n Ala Thr Ile Glu Gly     #               695     - ACC GAC GGC AAC GAC AGC TTG CAA GGG ACC GG - #G GCC GAC GAA CTG CTC     12108     Thr Asp Gly Asn Asp Ser Leu Gln Gly Thr Gl - #y Ala Asp Glu Leu Leu     #           710     - CTC GGT CTG GGC GGC CGG GAC AGC CTG AAC GG - #C GGC GCC GGC GAC GAT     12156     Leu Gly Leu Gly Gly Arg Asp Ser Leu Asn Gl - #y Gly Ala Gly Asp Asp     #       725     - GTC CTG GAT GGC GGG GCG GAG CGC GAC ACC CT - #G ACC GGC GGC ACG GGG     12204     Val Leu Asp Gly Gly Ala Glu Arg Asp Thr Le - #u Thr Gly Gly Thr Gly     #   740     - GCC GAC ACC TTC CTG TTC TCC GCG CGT ACC GA - #C AGC TAC CGC ACC GAC     12252     Ala Asp Thr Phe Leu Phe Ser Ala Arg Thr As - #p Ser Tyr Arg Thr Asp     745                 7 - #50                 7 - #55                 7 -     #60     - AGC GCC AGC TTC ACC GAC CTG ATC ACC GAC TT - #C GAT CCC GCC CAG GAT     12300     Ser Ala Ser Phe Thr Asp Leu Ile Thr Asp Ph - #e Asp Pro Ala Gln Asp     #               775     - CGC ATC GAC CTG TCC GGC CTG GGC TTC AGC GG - #T TTC GGC AAC GGC TAC     12348     Arg Ile Asp Leu Ser Gly Leu Gly Phe Ser Gl - #y Phe Gly Asn Gly Tyr     #           790     - GAC GGC ACC CTG CTG CTG CAG GTC AAC GCC GC - #G GGC ACC CGC ACC TAC     12396     Asp Gly Thr Leu Leu Leu Gln Val Asn Ala Al - #a Gly Thr Arg Thr Tyr     #       805     - CTG AAG AGC CTC GAG GCC GAT GCC GAC GGC CA - #G CGC TTC GAG ATC GCC     12444     Leu Lys Ser Leu Glu Ala Asp Ala Asp Gly Gl - #n Arg Phe Glu Ile Ala     #   820     - CTG GAC GGC GAC TTC AGC GGC CAG TTG GAC AG - #C GGC AAC GTG ATC TTC     12492     Leu Asp Gly Asp Phe Ser Gly Gln Leu Asp Se - #r Gly Asn Val Ile Phe     825                 8 - #30                 8 - #35                 8 -     #40     - GAG GCC GGC GTG TTC AAT GCC AAG GAC TTC GG - #C GCG CTG GGC GAC GGC     12540     Glu Ala Gly Val Phe Asn Ala Lys Asp Phe Gl - #y Ala Leu Gly Asp Gly     #               855     - GCC AGC GAC GAC CGG CCG GCC ATC CAG GCG GC - #G ATC GAC GCC GCC TAC     12588     Ala Ser Asp Asp Arg Pro Ala Ile Gln Ala Al - #a Ile Asp Ala Ala Tyr     #           870     - (2) INFORMATION FOR SEQ ID NO:2:     -      (i) SEQUENCE CHARACTERISTICS:               (A) LENGTH:  553 ami - #no acids               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:     - Met Asp Tyr Asn Val Lys Asp Phe Gly Ala Le - #u Gly Asp Gly Val Ser     #                 15     - Asp Asp Arg Ala Ser Ile Gln Ala Ala Ile As - #p Ala Ala Tyr Ala Ala     #             30     - Gly Gly Gly Thr Val Tyr Leu Pro Ala Gly Gl - #u Tyr Arg Val Ser Ala     #         45     - Ala Gly Glu Pro Gly Asp Gly Cys Leu Met Le - #u Lys Asp Gly Val Tyr     #     60     - Leu Ala Gly Ala Gly Met Gly Glu Thr Val Il - #e Lys Leu Ile Asp Gly     # 80     - Ser Asp Gln Lys Ile Thr Gly Met Val Arg Se - #r Ala Tyr Gly Glu Glu     #                 95     - Thr Ser Asn Phe Gly Met Arg Asp Leu Thr Le - #u Asp Gly Asn Arg Asp     #           110     - Asn Thr Ser Gly Lys Val Asp Gly Trp Phe As - #n Gly Tyr Ile Pro Gly     #       125     - Gly Asp Gly Ala Asp Arg Asp Val Thr Ile Gl - #u Arg Val Glu Val Arg     #   140     - Glu Met Ser Gly Tyr Gly Phe Asp Pro His Gl - #u Gln Thr Ile Asn Leu     145                 1 - #50                 1 - #55                 1 -     #60     - Thr Ile Arg Asp Ser Val Ala His Asp Asn Gl - #y Leu Asp Gly Phe Val     #               175     - Ala Asp Tyr Leu Val Asp Ser Val Phe Glu As - #n Asn Val Ala Tyr Ala     #           190     - Asn Asp Arg His Gly Phe Asn Val Val Thr Se - #r Thr His Asp Phe Val     #       205     - Met Thr Asn Asn Val Ala Tyr Gly Asn Gly Se - #r Ser Gly Leu Val Val     #   220     - Gln Arg Gly Leu Glu Asp Leu Ala Leu Pro Se - #r Asn Ile Leu Ile Asp     225                 2 - #30                 2 - #35                 2 -     #40     - Gly Gly Ala Tyr Tyr Asp Asn Ala Arg Glu Gl - #y Val Leu Leu Lys Met     #               255     - Thr Ser Asp Ile Thr Leu Gln Asn Ala Asp Il - #e His Gly Asn Gly Ser     #           270     - Ser Gly Val Arg Val Tyr Gly Ala Gln Asp Va - #l Gln Ile Leu Asp Asn     #       285     - Gln Ile His Asp Asn Ala Gln Ala Ala Ala Va - #l Pro Glu Val Leu Leu     #   300     - Gln Ser Phe Asp Asp Thr Ala Gly Ala Ser Gl - #y Thr Tyr Tyr Thr Thr     305                 3 - #10                 3 - #15                 3 -     #20     - Leu Asn Thr Arg Ile Glu Gly Asn Thr Ile Se - #r Gly Ser Ala Asn Ser     #               335     - Thr Tyr Gly Ile Gln Glu Arg Asn Asp Gly Th - #r Asp Tyr Ser Ser Leu     #           350     - Ile Asp Asn Asp Ile Ala Gly Val Gln Gln Pr - #o Ile Gln Leu Tyr Gly     #       365     - Pro His Ser Thr Val Ser Gly Glu Pro Gly Al - #a Thr Pro Gln Gln Pro     #   380     - Ser Thr Gly Ser Asp Gly Glu Pro Leu Val Gl - #y Gly Asp Thr Asp Asp     385                 3 - #90                 3 - #95                 4 -     #00     - Gln Leu Gln Gly Gly Ser Gly Ala Asp Arg Le - #u Asp Gly Gly Ala Gly     #               415     - Asp Asp Ile Leu Asp Gly Gly Ala Gly Arg As - #p Arg Leu Ser Gly Gly     #           430     - Ala Gly Ala Asp Thr Phe Val Phe Ser Ala Ar - #g Glu Asp Ser Tyr Arg     #       445     - Thr Asp Thr Ala Val Phe Asn Asp Leu Ile Le - #u Asp Phe Glu Ala Ser     #   460     - Glu Asp Arg Ile Asp Leu Ser Ala Leu Gly Ph - #e Ser Gly Leu Gly Asp     465                 4 - #70                 4 - #75                 4 -     #80     - Gly Tyr Gly Gly Thr Leu Leu Leu Lys Thr As - #n Ala Glu Gly Thr Arg     #               495     - Thr Tyr Leu Lys Ser Phe Glu Ala Asp Ala Gl - #u Gly Arg Arg Phe Glu     #           510     - Val Ala Leu Asp Gly Asp His Thr Gly Asp Le - #u Ser Ala Ala Asn Val     #       525     - Val Phe Ala Ala Thr Gly Thr Thr Thr Glu Le - #u Glu Val Leu Gly Asp     #   540     - Ser Gly Thr Gln Ala Gly Ala Ile Val     545                 5 - #50     - (2) INFORMATION FOR SEQ ID NO:3:     -      (i) SEQUENCE CHARACTERISTICS:               (A) LENGTH:  1403 am - #ino acids               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:     - Met Asp Tyr Asn Val Lys Asp Phe Gly Ala Le - #u Gly Asp Gly Val Ser     #                 15     - Asp Asp Thr Ala Ala Ile Gln Ala Ala Ile As - #p Ala Ala His Ala Ala     #             30     - Gly Gly Gly Thr Val Tyr Leu Pro Ala Gly Gl - #u Tyr Arg Val Ser Gly     #         45     - Gly Glu Glu Pro Ser Asp Gly Cys Leu Thr Il - #e Lys Ser Asn Val His     #     60     - Ile Val Gly Ala Gly Met Gly Glu Thr Val Il - #e Lys Met Val Asp Gly     # 80     - Trp Thr Gln Asn Val Thr Gly Met Val Arg Se - #r Ala Tyr Gly Glu Glu     #                 95     - Thr Ser Asn Phe Gly Met Ser Asp Leu Thr Le - #u Asp Gly Asn Arg Asp     #           110     - Asn Leu Ser Ala Lys Val Asp Gly Trp Phe As - #n Gly Tyr Ile Pro Gly     #       125     - Gln Asp Gly Ala Asp Arg Asp Val Thr Leu Gl - #u Arg Val Glu Ile Arg     #   140     - Glu Met Ser Gly Tyr Gly Phe Asp Pro His Gl - #u Gln Thr Ile Asn Leu     145                 1 - #50                 1 - #55                 1 -     #60     - Thr Ile Arg Asp Ser Val Ala His Asp Asn Se - #r Leu Asp Gly Phe Val     #               175     - Ala Asp Tyr Gln Val Gly Gly Val Phe Glu As - #n Asn Val Ser Tyr Asn     #           190     - Asn Asp Arg His Gly Phe Asn Ile Val Thr Se - #r Thr Asn Asp Phe Val     #       205     - Leu Ser Asn Asn Val Ala Tyr Gly Asn Gly Gl - #y Ala Gly Leu Val Val     #   220     - Gln Arg Gly Ser Tyr Asp Leu Pro His Pro Ty - #r Asp Ile Leu Ile Asp     225                 2 - #30                 2 - #35                 2 -     #40     - Gly Gly Ala Tyr Tyr Asp Asn Ala Leu Glu Gl - #y Val Gln Leu Lys Met     #               255     - Ala His Asp Val Thr Leu Gln Asn Ala Glu Il - #e Tyr Gly Asn Gly Leu     #           270     - Tyr Gly Val Arg Val Tyr Gly Ala Gln Asp Va - #l Gln Ile Leu Asp Asn     #       285     - Gln Ile His Asp Asn Ser Gln Asn Gly Ala Ty - #r Ala Glu Val Leu Leu     #   300     - Gln Ser Tyr Asp Asp Thr Ala Gly Val Ser Gl - #y Asn Phe Tyr Val Thr     305                 3 - #10                 3 - #15                 3 -     #20     - Thr Gly Thr Trp Leu Glu Gly Asn Val Ile Se - #r Gly Ser Ala Asn Ser     #               335     - Thr Tyr Gly Ile Gln Glu Arg Ala Asp Gly Th - #r Asp Tyr Ser Ser Leu     #           350     - Tyr Ala Asn Ser Ile Asp Gly Val Gln Thr Gl - #y Ala Val Arg Leu Tyr     #       365     - Gly Ala Asn Ser Thr Val Ser Ser Gln Ser Gl - #y Ser Gly Gln Gln Ala     #   380     - Thr Leu Glu Gly Ser Ala Gly Asn Asp Ala Le - #u Ser Gly Thr Glu Ala     385                 3 - #90                 3 - #95                 4 -     #00     - His Glu Thr Leu Leu Gly Gln Ala Gly Asp As - #p Arg Leu Asn Gly Asp     #               415     - Ala Gly Asn Asp Ile Leu Asp Gly Gly Ala Gl - #y Arg Asp Asn Leu Thr     #           430     - Gly Gly Ala Gly Ala Asp Thr Phe Arg Phe Se - #r Ala Arg Thr Asp Ser     #       445     - Tyr Arg Thr Asp Ser Ala Ser Phe Asn Asp Le - #u Ile Thr Asp Phe Asp     #   460     - Ala Asp Glu Asp Ser Ile Asp Leu Ser Ala Le - #u Gly Phe Thr Gly Leu     465                 4 - #70                 4 - #75                 4 -     #80     - Gly Asp Gly Tyr Asn Gly Thr Leu Leu Leu Ly - #s Thr Asn Ala Glu Gly     #               495     - Thr Arg Thr Tyr Leu Lys Ser Tyr Glu Ala As - #p Ala Gln Gly Arg Arg     #           510     - Phe Glu Ile Ala Leu Asp Gly Asn Phe Thr Gl - #y Leu Phe Asn Asp Asn     #       525     - Asn Leu Leu Phe Asp Ala Ala Pro Ala Thr Gl - #y Thr Glu Gly Ser Asp     #   540     - Asn Leu Leu Gly Thr Asp Ala Gly Glu Thr Le - #u Leu Gly Tyr Gly Gly     545                 5 - #50                 5 - #55                 5 -     #60     - Asn Asp Thr Leu Asn Gly Gly Ala Gly Asp As - #p Ile Leu Val Gly Gly     #               575     - Ala Gly Arg Asp Ser Leu Thr Gly Gly Ala Gl - #y Ala Asp Val Phe Arg     #           590     - Phe Asp Ala Leu Ser Asp Ser Gln Arg Asn Ty - #r Thr Thr Gly Asp Asn     #       605     - Gln Ala Asp Arg Ile Leu Asp Phe Asp Pro Th - #r Leu Asp Arg Ile Asp     #   620     - Val Ser Ala Leu Gly Phe Thr Gly Leu Gly As - #n Gly Arg Asn Gly Thr     625                 6 - #30                 6 - #35                 6 -     #40     - Leu Ala Val Val Leu Asn Ser Ala Gly Asp Ar - #g Thr Asp Leu Lys Ser     #               655     - Tyr Asp Thr Asp Ala Asn Gly Tyr Ser Phe Gl - #u Leu Ser Leu Ala Gly     #           670     - Asn Tyr Gln Gly Gln Leu Ser Ala Glu Gln Ph - #e Val Phe Ala Thr Ser     #       685     - Gln Gly Gly Gln Met Thr Ile Ile Glu Gly Th - #r Asp Gly Asn Asp Thr     #   700     - Leu Gln Gly Thr Glu Ala Asn Glu Arg Leu Le - #u Gly Leu Asp Gly Arg     705                 7 - #10                 7 - #15                 7 -     #20     - Asp Asn Leu Asn Gly Gly Ala Gly Asp Asp Il - #e Leu Asp Gly Gly Ala     #               735     - Gly Arg Asp Thr Leu Thr Gly Gly Thr Gly Al - #a Asp Thr Phe Leu Phe     #           750     - Ser Thr Arg Thr Asp Ser Tyr Arg Thr Asp Se - #r Ala Ser Phe Asn Asp     #       765     - Leu Ile Thr Asp Phe Asp Pro Thr Gln Asp Ar - #g Ile Asp Leu Ser Gly     #   780     - Leu Gly Phe Ser Gly Phe Gly Asn Gly Tyr As - #p Gly Thr Leu Leu Leu     785                 7 - #90                 7 - #95                 8 -     #00     - Gln Val Asn Ala Ala Gly Thr Arg Thr Tyr Le - #u Lys Ser Phe Glu Ala     #               815     - Asp Ala Asn Gly Gln Arg Phe Glu Ile Ala Le - #u Asp Gly Asp Phe Ser     #           830     - Gly Gln Leu Asp Ser Gly Asn Val Ile Phe Gl - #u Pro Ala Val Phe Asn     #       845     - Ala Lys Asp Phe Gly Ala Leu Gly Asp Gly Al - #a Ser Asp Asp Arg Pro     #   860     - Ala Ile Gln Ala Ala Ile Asp Ala Ala Tyr Al - #a Ala Gly Gly Gly Thr     865                 8 - #70                 8 - #75                 8 -     #80     - Val Tyr Leu Pro Ala Gly Glu Tyr Arg Val Se - #r Pro Thr Gly Glu Pro     #               895     - Gly Asp Gly Cys Leu Met Leu Lys Asp Gly Va - #l Tyr Leu Ala Gly Asp     #           910     - Gly Ile Gly Glu Thr Val Ile Lys Leu Ile As - #p Gly Ser Asp Gln Lys     #       925     - Ile Thr Gly Met Val Arg Ser Ala Tyr Gly Gl - #u Glu Thr Ser Asn Phe     #   940     - Gly Met Ser Asp Leu Thr Leu Asp Gly Asn Ar - #g Asp Asn Thr Ser Gly     945                 9 - #50                 9 - #55                 9 -     #60     - Lys Val Asp Gly Trp Phe Asn Gly Tyr Ile Pr - #o Gly Gln Asp Gly Ala     #               975     - Asp Arg Asn Val Thr Ile Glu Arg Val Glu Il - #e Arg Glu Met Ser Gly     #           990     - Tyr Gly Phe Asp Pro His Glu Gln Thr Ile As - #n Leu Thr Ile Arg Asp     #      10050     - Ser Val Ala His Asp Asn Gly Leu Asp Gly Ph - #e Val Ala Asp Tyr Leu     #  10205     - Val Asp Ser Val Phe Glu Asn Asn Val Ala Ty - #r Asn Asn Asp Arg His     #               10401030 - #                1035     - Gly Phe Asn Ile Val Thr Ser Thr Tyr Asp Ph - #e Val Met Thr Asn Asn     #              10550     - Val Ala Tyr Gly Asn Gly Gly Ala Gly Leu Th - #r Ile Gln Arg Gly Ser     #          10705     - Glu Asp Leu Ala Gln Pro Thr Asp Ile Leu Il - #e Asp Gly Gly Ala Tyr     #      10850     - Tyr Asp Asn Ala Leu Glu Gly Val Leu Phe Ly - #s Met Thr Asn Asn Val     #  11005     - Thr Leu Gln Asn Ala Glu Ile Tyr Gly Asn Gl - #y Ser Ser Gly Val Arg     #               11201110 - #                1115     - Leu Tyr Gly Thr Glu Asp Val Gln Ile Leu As - #p Asn Gln Ile His Asp     #              11350     - Asn Ser Gln Asn Gly Thr Tyr Pro Glu Val Le - #u Leu Gln Ala Phe Asp     #          11505     - Asp Ser Gln Val Thr Gly Glu Leu Tyr Glu Th - #r Leu Asn Thr Arg Ile     #      11650     - Glu Gly Asn Leu Ile Asp Ala Ser Asp Asn Al - #a Asn Tyr Ala Val Arg     #  11805     - Glu Arg Asp Asp Gly Ser Asp Tyr Thr Thr Le - #u Val Asp Asn Asp Ile     #               12001190 - #                1195     - Ser Gly Gly Gln Val Ala Ser Val Gln Leu Se - #r Gly Ala His Ser Ser     #              12150     - Leu Ser Gly Gly Thr Val Glu Val Pro Gln Gl - #y Thr Asp Gly Asn Asp     #          12305     - Val Leu Val Gly Ser Asp Ala Asn Asp Gln Le - #u Tyr Gly Gly Ala Gly     #      12450     - Asp Asp Arg Leu Asp Gly Gly Ala Gly Asp As - #p Leu Leu Asp Gly Gly     #  12605     - Ala Gly Arg Asp Asp Leu Thr Gly Gly Thr Gl - #y Ala Asp Thr Phe Val     #               12801270 - #                1275     - Phe Ala Ala Arg Thr Asp Ser Tyr Arg Thr As - #p Ala Gly Val Phe Asn     #              12950     - Asp Leu Ile Leu Asp Phe Asp Ala Ser Glu As - #p Arg Ile Asp Leu Ser     #          13105     - Ala Leu Gly Phe Ser Gly Phe Gly Asp Gly Ty - #r Asn Gly Thr Leu Leu     #      13250     - Val Gln Leu Ser Ser Ala Gly Thr Arg Thr Ty - #r Leu Lys Ser Tyr Glu     #  13405     - Glu Asp Leu Glu Gly Arg Arg Phe Glu Val Al - #a Leu Asp Gly Asp His     #               13601350 - #                1355     - Thr Gly Asp Leu Ser Ala Ala Asn Val Val Ph - #e Ala Asp Asp Gly Ser     #              13750     - Ala Ala Val Ala Ser Ser Asp Pro Ala Ala Th - #r Gln Leu Glu Val Val     #          13905     - Gly Ser Ser Gly Thr Gln Thr Asp Gln Leu Al - #a     #       1400     - (2) INFORMATION FOR SEQ ID NO:4:     -      (i) SEQUENCE CHARACTERISTICS:               (A) LENGTH:  997 ami - #no acids               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:     - Met Asp Tyr Asn Val Lys Asp Phe Gly Ala Le - #u Gly Asp Gly Val Ser     #                 15     - Asp Asp Thr Ala Ala Ile Gln Ala Ala Ile As - #p Ala Ala Tyr Ala Ala     #             30     - Gly Gly Gly Thr Val Tyr Leu Pro Ala Gly Gl - #u Tyr Arg Val Ser Gly     #         45     - Gly Glu Glu Pro Ser Asp Gly Cys Leu Thr Il - #e Lys Ser Asn Val His     #     60     - Ile Val Gly Ala Gly Met Gly Glu Thr Val Il - #e Lys Leu Val Asp Gly     # 80     - Trp Asp Gln Asp Val Thr Gly Ile Val Arg Se - #r Ala Tyr Gly Glu Glu     #                 95     - Thr Ser Asn Phe Gly Met Ser Asp Leu Thr Le - #u Asp Gly Asn Arg Asp     #           110     - Asn Thr Ser Gly Lys Val Asp Gly Trp Phe As - #n Gly Tyr Ile Pro Gly     #       125     - Glu Asp Gly Ala Asp Arg Asp Val Thr Leu Gl - #u Arg Val Glu Ile Arg     #   140     - Glu Met Ser Gly Tyr Gly Phe Asp Pro His Gl - #u Gln Thr Ile Asn Leu     145                 1 - #50                 1 - #55                 1 -     #60     - Thr Ile Arg Asp Ser Val Ala His Asp Asn Gl - #y Leu Asp Gly Phe Val     #               175     - Ala Asp Phe Gln Ile Gly Gly Val Phe Glu As - #n Asn Val Ser Tyr Asn     #           190     - Asn Asp Arg His Gly Phe Asn Ile Val Thr Se - #r Thr Asn Asp Phe Val     #       205     - Leu Ser Asn Asn Val Ala Tyr Gly Asn Gly Gl - #y Ala Gly Leu Val Val     #   220     - Gln Arg Gly Ser Ser Asp Val Ala His Pro Ty - #r Asp Ile Leu Ile Asp     225                 2 - #30                 2 - #35                 2 -     #40     - Gly Gly Ala Tyr Tyr Asp Asn Gly Leu Glu Gl - #y Val Gln Ile Lys Met     #               255     - Ala His Asp Val Thr Leu Gln Asn Ala Glu Il - #e Tyr Gly Asn Gly Leu     #           270     - Tyr Gly Val Arg Val Tyr Gly Ala Glu Asp Va - #l Gln Ile Leu Asp Asn     #       285     - Tyr Ile His Asp Asn Ser Gln Asn Gly Ser Ty - #r Ala Glu Ile Leu Leu     #   300     - Gln Ser Tyr Asp Asp Thr Ala Gly Val Ser Gl - #y Asn Phe Tyr Thr Thr     305                 3 - #10                 3 - #15                 3 -     #20     - Thr Gly Thr Trp Ile Glu Gly Asn Thr Ile Va - #l Gly Ser Ala Asn Ser     #               335     - Thr Tyr Gly Ile Gln Glu Arg Asp Asp Gly Th - #r Asp Tyr Ser Ser Leu     #           350     - Tyr Ala Asn Ser Val Ser Asn Val Gln Asn Gl - #y Ser Val Arg Leu Tyr     #       365     - Gly Ala Asn Ser Val Val Ser Asp Leu Pro Gl - #y Thr Gly Gln Gln Ala     #   380     - Thr Leu Glu Gly Thr Ala Gly Asn Asp Thr Le - #u Gly Gly Ser Asp Ala     385                 3 - #90                 3 - #95                 4 -     #00     - His Glu Thr Leu Leu Gly Leu Asp Gly Asn As - #p Arg Leu Asn Gly Gly     #               415     - Ala Gly Asn Asp Ile Leu Asp Gly Gly Ala Gl - #y Arg Asp Asn Leu Thr     #           430     - Gly Gly Ala Gly Ala Asp Leu Phe Arg Val Se - #r Ala Arg Thr Asp Ser     #       445     - Tyr Arg Thr Asp Ser Ala Ser Phe Asn Asp Le - #u Ile Thr Asp Phe Asp     #   460     - Ala Ser Gln Asp Arg Ile Asp Leu Ser Ala Le - #u Gly Phe Thr Gly Leu     465                 4 - #70                 4 - #75                 4 -     #80     - Gly Asp Gly Tyr Asn Gly Thr Leu Leu Leu Gl - #n Val Ser Ala Asp Gly     #               495     - Ser Arg Thr Tyr Leu Lys Ser Leu Glu Ala As - #p Ala Glu Gly Arg Arg     #           510     - Phe Glu Ile Ala Leu Asp Gly Asn Phe Ala Gl - #y Leu Leu Gly Ala Gly     #       525     - Asn Leu Leu Phe Glu Arg Thr Ala Ile Glu Gl - #y Asp Ala Gly Asp Asn     #   540     - Ala Leu Leu Gly Thr Ser Ala Ala Glu Thr Le - #u Leu Gly His Ala Gly     545                 5 - #50                 5 - #55                 5 -     #60     - Asn Asp Thr Leu Asp Gly Gly Ala Gly Asp As - #p Ile Leu Val Gly Gly     #               575     - Ala Gly Arg Asp Ser Leu Thr Gly Gly Ala Gl - #y Ala Asp Val Phe Arg     #           590     - Phe Asp Ala Leu Ser Asp Ser Gln Arg Asn Ty - #r Asp Ile Gly Asp Asn     #       605     - Gln Gly Asp Arg Ile Ala Asp Phe Ala Val Gl - #y Glu Asp Lys Leu Asp     #   620     - Val Ser Ala Leu Gly Phe Thr Gly Leu Gly As - #p Gly Tyr Asn Gly Thr     625                 6 - #30                 6 - #35                 6 -     #40     - Leu Ala Leu Val Leu Asn Ser Ala Gly Asp Ar - #g Thr Tyr Val Lys Ser     #               655     - Tyr Glu Asn Gly Ala Asp Gly Tyr Arg Phe Gl - #u Phe Ser Leu Asp Gly     #           670     - Asn Tyr Leu Glu Leu Leu Gly Asn Glu Asp Ph - #e Ile Phe Ala Thr Pro     #       685     - Ser Gly Gln Gln Leu Leu Glu Gly Ser Ala Gl - #y Asn Asp Ser Leu Gln     #   700     - Gly Thr Ala Ala Asp Glu Val Ile His Gly Gl - #y Gly Gly Arg Asp Thr     705                 7 - #10                 7 - #15                 7 -     #20     - Leu Ala Gly Gly Ala Gly Ala Asp Val Phe Ar - #g Phe Ser Glu Leu Thr     #               735     - Asp Ser Tyr Arg Asp Ser Ala Ser Tyr Ala As - #p Leu Ile Thr Asp Phe     #           750     - Asp Ala Ser Glu Asp Arg Ile Asp Leu Ser Gl - #y Leu Gly Phe Ser Gly     #       765     - Leu Gly Asn Gly Tyr Gly Gly Thr Leu Ala Le - #u Gln Val Asn Ser Ala     #   780     - Gly Thr Arg Thr Tyr Leu Lys Ser Phe Glu Th - #r Asn Ala Ala Gly Glu     785                 7 - #90                 7 - #95                 8 -     #00     - Arg Phe Glu Ile Ala Leu Asp Gly Asp Leu Se - #r Ala Leu Gly Gly Ala     #               815     - Asn Leu Ile Leu Asp Ala Arg Thr Val Leu Al - #a Gly Gly Asp Gly Asn     #           830     - Asp Thr Leu Ser Gly Ser Ser Ala Ala Glu Gl - #u Leu Leu Gly Gly Val     #       845     - Gly Asn Asp Ser Leu Asp Gly Gly Ala Gly As - #n Asp Ile Leu Asp Gly     #   860     - Gly Ala Gly Arg Asp Thr Leu Ser Gly Gly Se - #r Gly Ser Asp Ile Phe     865                 8 - #70                 8 - #75                 8 -     #80     - Arg Phe Gly Gly Ala Leu Asp Ser Phe Arg As - #n Tyr Ala Ser Gly Thr     #               895     - Asn Gly Thr Asp Ser Ile Thr Asp Phe Thr Pr - #o Gly Glu Asp Leu Ile     #           910     - Asp Leu Ser Val Leu Gly Tyr Thr Gly Leu Gl - #y Asp Gly Tyr Asn Gly     #       925     - Thr Leu Ala Ile Val Leu Asn Asp Ala Gly Th - #r Lys Thr Tyr Leu Lys     #   940     - Asn Arg Glu Ser Asp Ala Glu Gly Asn Gln Ph - #e Glu Ile Ala Leu Glu     945                 9 - #50                 9 - #55                 9 -     #60     - Gly Asn His Ala Asp Gln Leu Asp Ala Ser As - #p Phe Ile Phe Ala Thr     #               975     - Ala Ala Ala Thr Thr Gly Ile Glu Val Val Gl - #y Gly Ser Gly Thr Gln     #           990     - Thr Asp Gln Leu Ala             995     - (2) INFORMATION FOR SEQ ID NO:5:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 872 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:     - Met Asp Phe Asn Val Lys Asp Phe Gly Ala Le - #u Gly Asp Gly Ala Ser     #                 15     - Asp Asp Thr Ala Ala Ile Gln Ala Ala Ile As - #p Ala Ala His Ala Ala     #             30     - Gly Gly Gly Thr Val Tyr Leu Pro Ala Gly Gl - #u Tyr Arg Val Ser Gly     #         45     - Gly Glu Glu Pro Ser Asp Gly Ala Leu Thr Il - #e Lys Ser Asn Val Tyr     #     60     - Ile Val Gly Ala Gly Met Gly Glu Thr Val Il - #e Lys Met Val Asp Gly     # 80     - Trp Thr Gln Asn Val Thr Gly Met Val Arg Se - #r Ala Tyr Gly Glu Glu     #                 95     - Thr Ser Asn Phe Gly Met Ser Asp Leu Thr Le - #u Asp Gly Asn Arg Asp     #           110     - Asn Leu Ser Ala Lys Val Asp Gly Trp Phe As - #n Gly Tyr Ile Pro Gly     #       125     - Gln Asp Gly Ala Asp Arg Asp Val Thr Leu Gl - #u Arg Val Glu Ile Arg     #   140     - Glu Met Ser Gly Tyr Gly Phe Asp Pro His Gl - #u Gln Thr Ile Asn Leu     145                 1 - #50                 1 - #55                 1 -     #60     - Thr Ile Arg Asp Ser Val Ala His Asp Asn Gl - #y Leu Asp Gly Phe Val     #               175     - Ala Asp Tyr Gln Val Gly Gly Val Phe Glu As - #n Asn Val Ser Tyr Asn     #           190     - Asn Asp Arg His Gly Phe Asn Ile Val Thr Se - #r Thr Asn Asp Phe Val     #       205     - Leu Ser Asn Asn Val Ala Tyr Gly Asn Gly Gl - #y Ala Gly Leu Val Val     #   220     - Gln Arg Gly Ser Tyr Asp Leu Pro His Pro Ty - #r Asp Ile Leu Ile Asp     225                 2 - #30                 2 - #35                 2 -     #40     - Gly Gly Ala Tyr Tyr Asp Asn Ala Leu Glu Gl - #y Val Gln Leu Lys Met     #               255     - Thr His Asp Val Thr Leu Gln Asn Ala Glu Il - #e Tyr Gly Asn Gly Leu     #           270     - Tyr Gly Val Arg Val Tyr Gly Ala Gln Asp Va - #l Gln Leu Leu Asp Asn     #       285     - Gln Ile His Asp Asn Ser Gln Asn Gly Ala Ty - #r Ala Glu Val Leu Leu     #   300     - Gln Ser Tyr Asp Asp Thr Ala Gly Val Ser Gl - #y Asn Phe Tyr Val Thr     305                 3 - #10                 3 - #15                 3 -     #20     - Thr Gly Thr Trp Leu Glu Gly Asn Val Ile Se - #r Gly Ser Ala Asn Ser     #               335     - Thr Phe Gly Ile Gln Glu Arg Ala Asp Gly Th - #r Asp Tyr Ser Ser Leu     #           350     - Tyr Ala Asn Thr Ile Asp Gly Val Gln Asn Gl - #y Thr Val Arg Leu Tyr     #       365     - Gly Ala Asn Ser Thr Val Ser Glu Gln Pro Se - #r Ser Gly Gln Gln Ala     #   380     - Thr Leu Glu Gly Thr Ala Gly Asn Asp Val Le - #u Ser Gly Thr Gly Ala     385                 3 - #90                 3 - #95                 4 -     #00     - His Glu Leu Ile Leu Gly Leu Ala Gly Asn As - #p Arg Leu Asp Gly Gly     #               415     - Ala Gly Asp Asp Thr Leu Asp Gly Gly Ala Gl - #y Arg Asp Thr Leu Thr     #           430     - Gly Gly Ala Gly Ala Asp Thr Phe Arg Phe Se - #r Ala Arg Glu Asp Ser     #       445     - His Arg Thr Asp Ser Ala Ser Phe Thr Asp Le - #u Ile Thr Asp Phe Asp     #   460     - Ala Ser Gln Asp Arg Ile Asp Leu Ser Ala Le - #u Gly Phe Thr Gly Leu     465                 4 - #70                 4 - #75                 4 -     #80     - Gly Asn Gly Tyr Asp Gly Thr Leu Ala Val Th - #r Thr Gly Ser Gly Gly     #               495     - Thr Arg Thr Tyr Leu Lys Ser Tyr Glu Val As - #p Ala Gln Gly Arg Arg     #           510     - Phe Glu Ile Ala Leu Asp Gly Asn Phe Val Gl - #y Gln Phe Asn Asp Gly     #       525     - Asn Leu Leu Phe Asp Ala Ala Pro Val Thr Gl - #y Thr Glu Gly Asn Asp     #   540     - Asn Leu Ser Gly Thr Asp Ala Gly Glu Thr Le - #u Leu Gly Tyr Gly Gly     545                 5 - #50                 5 - #55                 5 -     #60     - Asn Asp Thr Leu Asn Gly Gly Ala Gly Asn As - #p Ile Leu Val Gly Gly     #               575     - Ala Gly Arg Asp Thr Leu Thr Gly Gly Ala Gl - #y Ala Asp Val Phe Arg     #           590     - Phe Glu Ala Leu Ser Asp Ser Gln Arg Asn Ty - #r Thr Ala Gly Asp Asn     #       605     - Gln Gly Asp Tyr Ile Ile Asp Phe Ala Val Gl - #y Glu Asp Arg Ile Asp     #   620     - Val Ser Ala Leu Gly Tyr Thr Gly Leu Gly As - #n Gly Arg Asn Gly Thr     625                 6 - #30                 6 - #35                 6 -     #40     - Leu Ala Val Val Leu Asn Ser Ala Gly Asp Ar - #g Thr Tyr Val Lys Ser     #               655     - Tyr Asp Thr Asp Ala Asn Gly Tyr Asn Phe Gl - #u Leu Ser Leu Ala Gly     #           670     - Asn Tyr Gln Gly Leu Leu Gly Ala Glu Gln Ph - #e Val Phe Ala Thr Pro     #       685     - Pro Glu Gln Ala Thr Ile Glu Gly Thr Asp Gl - #y Asn Asp Ser Leu Gln     #   700     - Gly Thr Gly Ala Asp Glu Leu Leu Leu Gly Le - #u Gly Gly Arg Asp Ser     705                 7 - #10                 7 - #15                 7 -     #20     - Leu Asn Gly Gly Ala Gly Asp Asp Val Leu As - #p Gly Gly Ala Glu Arg     #               735     - Asp Thr Leu Thr Gly Gly Thr Gly Ala Asp Th - #r Phe Leu Phe Ser Ala     #           750     - Arg Thr Asp Ser Tyr Arg Thr Asp Ser Ala Se - #r Phe Thr Asp Leu Ile     #       765     - Thr Asp Phe Asp Pro Ala Gln Asp Arg Ile As - #p Leu Ser Gly Leu Gly     #   780     - Phe Ser Gly Phe Gly Asn Gly Tyr Asp Gly Th - #r Leu Leu Leu Gln Val     785                 7 - #90                 7 - #95                 8 -     #00     - Asn Ala Ala Gly Thr Arg Thr Tyr Leu Lys Se - #r Leu Glu Ala Asp Ala     #               815     - Asp Gly Gln Arg Phe Glu Ile Ala Leu Asp Gl - #y Asp Phe Ser Gly Gln     #           830     - Leu Asp Ser Gly Asn Val Ile Phe Glu Ala Gl - #y Val Phe Asn Ala Lys     #       845     - Asp Phe Gly Ala Leu Gly Asp Gly Ala Ser As - #p Asp Arg Pro Ala Ile     #   860     - Gln Ala Ala Ile Asp Ala Ala Tyr     865                 8 - #70     - (2) INFORMATION FOR SEQ ID NO:6:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 15 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:     - Met Asp Tyr Asn Val Lys Asp Phe Gly Ala Le - #u Gly Asp Gly Val     #                15     - (2) INFORMATION FOR SEQ ID NO:7:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1155 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:     - ATGGATTACA ACGTCAAGGA TTTCGGGGCG CTGGGCGATG GCGTCAGCGA CG - #ACACGGCG       60     - GCCATCCAGG CGGCGATCGA TGCCGCCTAC GCGGCCGGCG GCGGCACCGT CT - #ACCTGCCG      120     - GCCGGCGAGT ACCGGGTCAG CGGCGGCGAG GAGCCTTCCG ACGGCTGCCT GA - #CCATCAAG      180     - AGCAACGTCT ATATCGTCGG CGCCGGGATG GGCGAGACGG TGATCAAGCT GG - #TCGACGGC      240     - TGGGAGCAGA ACGTCACCGG CATGGTGCGC TCGGCCTACG GCGAGGAGAC CA - #GCAACTTC      300     - GGCATGAGCG ACCTGACCCT CGACGGCAAC CGCGACAACA CCAGCGGCAA GG - #TCGACGGC      360     - TGGTTCAACG GCTACATCCC CGGCCAGGAC GGCGCCGACC GCGACGTGAC CC - #TGGAGCGG      420     - GTGGAAATCC GCGAGATGTC CGGCTACGGT TTCGATCCGC ACGAGCAGAC CA - #TCAACCTG      480     - ACGATCCGCG ACAGCGTGGC CCACGACAAC GGCCTCGACG GCTTCGTCGC CG - #ACTACCAG      540     - GTCGGCGGGG TGTTCGAGAA CAACGTCTCG TACAACAACG ACCGCCACGG CT - #TCAACATC      600     - GTCACCAGCA CCAACGACTT CGTCCTGAGC AACAACGTCG CCTACGGCAA CG - #GCGGCGCC      660     - GGCCTGGTGG TGCAGCGCGG CTCGTACGAC CTGGCCCAGC CCTACGACAT CC - #TGATCGAC      720     - GGCGGCGCCT ACTACGACAA CGCCCTGGAA GGCGTGCAGC TCAAGATGAC CC - #ACGACGTC      780     - ACCCTGCAGA ACGCCGAGAT CTACGGCAAC GGCCTCTACG GGGTGCGCGT CT - #ACGGCGCC      840     - CAGGACGTGC AGATCCTCGA CAACCAGATC CACGACAATT CGCAGAACGG CG - #CCTATGCC      900     - GAAGTCCTGC TGCAGTCCTA CGACGACACC GCCGGGGTGT CCGGCAACTT CT - #ACGCCACC      960     - ACCGGCACCT GGATCGAAGG CAACATCATC AGCGGCTCGG CCAACTCCAC CT - #ACGGCATC     1020     - CAGGAGCGCG ACGACGGCAC CGACTACAGC AGCCTCTACG CCAACAACAT CG - #GCGGTGTG     1080     - CAGAACGGGT CGGTACGGCT GTACGGCGCC AACTCGACGG TTTCCGGCCA GC - #CCGGCACC     1140     #  1155     - (2) INFORMATION FOR SEQ ID NO:8:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 385 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:     - Met Asp Phe Asn Val Lys Asp Phe Gly Ala Le - #u Gly Asp Gly Ala Ser     #                15     - Asp Asp Thr Ala Ala Ile Gln Ala Ala Ile As - #p Ala Ala Tyr Ala Ala     #            30     - Gly Gly Gly Thr Val Tyr Leu Pro Ala Gly Gl - #u Tyr Arg Val Ser Gly     #        45     - Gly Glu Glu Pro Ser Asp Gly Cys Leu Thr Il - #e Lys Ser Asn Val Tyr     #    60     - Ile Val Gly Ala Gly Met Gly Glu Thr Val Il - #e Lys Leu Val Asp Gly     #80     - Trp Asp Gln Asn Val Thr Gly Met Val Arg Se - #r Ala Tyr Gly Glu Glu     #                95     - Thr Ser Asn Phe Gly Met Ser Asp Leu Thr Le - #u Asp Gly Asn Arg Asp     #           110     - Asn Thr Ser Gly Lys Val Asp Gly Trp Phe As - #n Gly Tyr Ile Pro Gly     #       125     - Gln Asp Gly Ala Asp Arg Asp Val Thr Leu Gl - #u Arg Val Glu Ile Arg     #   140     - Glu Met Ser Gly Tyr Gly Phe Asp Pro His Gl - #u Gln Thr Ile Asn Leu     145                 1 - #50                 1 - #55                 1 -     #60     - Thr Ile Arg Asp Ser Val Ala His Asp Asn Gl - #y Leu Asp Gly Phe Val     #               175     - Ala Asp Tyr Gln Val Gly Gly Val Phe Glu As - #n Asn Val Ser Tyr Asn     #           190     - Asn Asp Arg His Gly Phe Asn Ile Val Thr Se - #r Thr Asn Asp Phe Val     #       205     - Leu Ser Asn Asn Val Ala Tyr Gly Asn Gly Gl - #y Ala Gly Leu Val Val     #   220     - Gln Arg Gly Ser Glu Asp Leu Ala His Pro Ty - #r Asp Ile Leu Ile Asp     225                 2 - #30                 2 - #35                 2 -     #40     - Gly Gly Ala Tyr Tyr Asp Asn Ala Leu Glu Gl - #y Val Gln Leu Lys Met     #               255     - Thr His Asp Val Thr Leu Gln Asn Ala Glu Il - #e Tyr Gly Asn Gly Leu     #           270     - Tyr Gly Val Arg Val Tyr Gly Ala Gln Asp Va - #l Gln Ile Leu Asp Asn     #       285     - Gln Ile His Asp Asn Ser Gln Asn Gly Ala Ty - #r Ala Glu Val Leu Leu     #   300     - Gln Ser Tyr Asp Asp Thr Ala Gly Val Ser Gl - #y Asn Phe Tyr Thr Thr     305                 3 - #10                 3 - #15                 3 -     #20     - Thr Gly Thr Trp Ile Glu Gly Asn Thr Ile Se - #r Gly Ser Ala Asn Ser     #               335     - Thr Tyr Gly Ile Gln Glu Arg Ala Asp Gly Th - #r Asp Tyr Ser Ser Leu     #           350     - Tyr Ala Asn Asp Ile Asp Gly Val Gln Thr Gl - #y Ser Val Arg Leu Tyr     #       365     - Gly Ala Asn Ser Thr Val Ser Gly Gln Pro Gl - #y Ser Gly Gln Gln Ala     #   380     - Thr     - 385     - (2) INFORMATION FOR SEQ ID NO:9:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 459 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:     - CTCGAAGGCA CCGACGGCAA CGACACGCTG CGCGGCACCG AGGCCGACGA GA - #CGCTCCTC       60     - GGCCAGGCCG GCAACGACCG CCTGAACGGC GGCGCCGGCG ACGACATCCT CG - #ACGGCGGC      120     - GCCGGGCGCG ACACCCTGAC CGGCGGCGCG GGCGCCGACA CCTTCCGCTT CT - #CCGCGCGG      180     - ACCGACAGCT ACCGCACCGA CAGCGCCGGC GACAGCTTCA ACGACCTGAT CA - #CCGACTTC      240     - GACGCCAGCG AGGACCGCAT CGACCTGTCC GCGCTGGGCT TCACCGGGCT GG - #GCGACGGC      300     - TACAACGGCA CCCTGCTGCT GGTGCTCAAC GCCGCCGGCA CCCGCACCTA CC - #TGAAGAGC      360     - TACGAGGCGG ACGCCGAGGG CCAGCGCTTC GAGATCGCCC TGGACGGCAA CT - #ACACCGGC      420     #   459            ACTT GGTCTTCGCC GCGGCCGCG     - (2) INFORMATION FOR SEQ ID NO:10:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 153 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:     - Leu Glu Gly Thr Asp Gly Asn Asp Thr Leu Se - #r Gly Thr Asp Ala His     #                15     - Glu Thr Leu Leu Gly Leu Ala Gly Asn Asp Ar - #g Leu Asn Gly Gly Ala     #            30     - Gly Asp Asp Ile Leu Asp Gly Gly Ala Gly Ar - #g Asp Thr Leu Thr Gly     #        45     - Gly Ala Gly Ala Asp Thr Phe Arg Phe Ser Al - #a Arg Thr Asp Ser Tyr     #    60     - Arg Thr Asp Ser Ala Gly Asp Ser Phe Asn As - #p Leu Ile Thr Asp Phe     #80     - Asp Ala Ser Glu Asp Arg Ile Asp Leu Ser Al - #a Leu Gly Phe Thr Gly     #                95     - Leu Gly Asp Gly Tyr Asn Gly Thr Leu Leu Le - #u Gln Leu Asn Ser Ala     #           110     - Gly Thr Arg Thr Tyr Leu Lys Ser Tyr Glu Al - #a Asp Ala Glu Gly Arg     #       125     - Arg Phe Glu Ile Ala Leu Asp Gly Asn Phe Th - #r Gly Leu Leu Gly Ala     #   140     - Glu Asn Phe Ile Phe Ala Thr Thr Pro     145                 1 - #50     - (2) INFORMATION FOR SEQ ID NO:11:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 9 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:     - Leu Xaa Gly Gly Ala Gly Xaa Asp Xaa     1               5     - (2) INFORMATION FOR SEQ ID NO:12:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 6 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:     - Ala Arg Arg Arg Arg Ser     1               5     - (2) INFORMATION FOR SEQ ID NO:13:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 7 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:     - Ala Arg Arg Arg Ala Arg Arg     1               5     - (2) INFORMATION FOR SEQ ID NO:14:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 4 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:     - Arg Ala Arg Ser     - (2) INFORMATION FOR SEQ ID NO:15:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 4 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:     - Ala Arg Arg Arg     1     - (2) INFORMATION FOR SEQ ID NO:16:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 20 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: other nucleic acid     #= "Oligonucleotide"PTION: /desc     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:     # 20               ARGA     - (2) INFORMATION FOR SEQ ID NO:17:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1176 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:     - ATGGATTACA ACGTCAAGGA TTTCGGTGCA TTGGGCGACG GCGTCAGCGA CG - #ACCGGGCC       60     - TCCATCCAGG CGGCGATCGA TGCCGCCTAC GCCGCCGGTG GCGGTACCGT CT - #ACCTGCCG      120     - GCCGGCGAGT ACCGGGTCAG CGCCGCCGGG GAGCCGGGCG ACGGCTGCCT GA - #TGCTCAAG      180     - GACGGCGTCT ACCTGGCCGG TGCCGGCATG GGCGAGACGG TGATCAAGCT GA - #TCGACGGC      240     - TCCGACCAGA AGATCACCGG CATGGTCCGC TCGGCCTACG GCGAGGAAAC CA - #GCAACTTC      300     - GGCATGCGCG ACCTGACCCT CGACGGCAAC CGCGACAACA CCAGCGGCAA GG - #TCGACGGC      360     - TGGTTCAACG GCTATATCCC CGGCGGGGAC GGCGCCGACC GCGACGTGAC CA - #TCGAGCGG      420     - GTGGAGGTCC GCGAGATGTC CGGCTACGGC TTCGACCCCC ACGAGCAGAC CA - #TCAACCTG      480     - ACGATCCGCG ACAGCGTGGC CCACGACAAC GGCCTCGACG GCTTCGTCGC CG - #ACTACCTG      540     - GTCGACAGCG TGTTCGAGAA CAACGTCGCC TACGCCAACG ACCGCCACGG CT - #TCAACGTG      600     - GTCACCAGCA CCCACGATTT CGTCATGACC AACAACGTCG CCTACGGCAA CG - #GCAGCAGC      660     - GGCCTGGTGG TGCAGCGGGG TCTGGAGGAC CTCGCGCTGC CCAGCAACAT CC - #TGATCGAC      720     - GGCGGCGCCT ACTACGACAA CGCCCGCGAA GGCGTGCTGC TCAAGATGAC CA - #GCGACATC      780     - ACCCTGCAGA ACGCCGATAT CCACGGCAAC GGCTCCTCCG GGGTGCGCGT CT - #ACGGCGCC      840     - CAGGACGTGC AGATCCTCGA TAACCAGATC CACGACAACG CGCAGGCGGC CG - #CCGTGCCC      900     - GAGGTCCTGC TGCAGTCCTT CGACGATACC GCCGGGGCGT CCGGCACCTA CT - #ACACGACC      960     - CTGAACACCC GGATCGAGGG CAACACCATC AGCGGCTCGG CCAACTCCAC CT - #ACGGCATC     1020     - CAGGAGCGCA ACGACGGCAC CGACTACAGC AGCCTGATCG ACAACGACAT CG - #CCGGGGTG     1080     - CAACAGCCCA TCCAACTGTA CGGACCTCAC TCGACGGTAT CCGGCGAACC CG - #GCGCGACA     1140     #     1176         CGGG AAGCGACGGC GAGCCA     - (2) INFORMATION FOR SEQ ID NO:18:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1155 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:     - ATGGATTACA ACGTCAAGGA TTTCGGAGCA CTGGGCGATG GCGTCAGCGA CG - #ACACGGCG       60     - GCCATCCAGG CGGCGATCGA CGCCGCCCAC GCGGCGGGCG GCGGCACCGT CT - #ACCTGCCG      120     - GCCGGCGAAT ATCGGGTCAG CGGCGGCGAG GAGCCTTCCG ATGGTTGTCT GA - #CCATCAAG      180     - AGCAACGTCC ATATCGTCGG CGCCGGGATG GGCGAGACGG TGATCAAGAT GG - #TCGACGGC      240     - TGGACGCAGA ACGTCACCGG CATGGTGCGC TCGGCCTACG GCGAGGAAAC CA - #GCAACTTC      300     - GGCATGAGCG ACCTGACCCT CGACGGCAAC CGCGACAACC TGTCCGCCAA GG - #TCGACGGC      360     - TGGTTCAACG GCTACATCCC CGGCCAGGAC GGCGCCGATC GCGACGTGAC CC - #TGGAGCGG      420     - GTGGAAATCC GCGAGATGTC CGGCTACGGT TTCGACCCCC ACGAGCAGAC CA - #TCAACCTG      480     - ACGATCCGCG ACAGCGTGGC CCACGACAAC AGCCTCGACG GCTTCGTCGC CG - #ACTACCAG      540     - GTCGGCGGGG TGTTCGAGAA CAACGTCTCG TACAACAACG ACCGCCACGG CT - #TCAACATC      600     - GTCACCAGCA CCAACGACTT CGTCCTGAGC AACAACGTCG CCTACGGCAA CG - #GCGGCGCC      660     - GGCCTGGTGG TGCAGCGCGG CTCGTACGAC CTGCCCCATC CCTACGACAT CC - #TGATCGAC      720     - GGCGGCGCCT ACTACGACAA CGCCTTGGAA GGCGTGCAGC TCAAGATGGC CC - #ACGACGTC      780     - ACCCTGCAGA ACGCCGAGAT CTACGGCAAC GGCCTGTACG GGGTGCGCGT CT - #ACGGCGCC      840     - CAGGACGTGC AGATCCTCGA CAACCAGATC CACGACAATT CGCAGAACGG CG - #CCTATGCC      900     - GAAGTCCTGC TGCAGTCCTA CGACGACACC GCCGGGGTGT CCGGCAACTT TT - #ACGTCACC      960     - ACCGGCACCT GGCTCGAAGG CAACGTCATC AGCGGCTCGG CCAATTCCAC CT - #ACGGCATC     1020     - CAGGAGCGCG CCGACGGCAC CGACTACAGC AGCCTCTACG CCAACAGCAT CG - #ACGGTGTG     1080     - CAGACCGGGG CGGTACGGCT GTATGGCGCC AACTCGACGG TTTCCAGCCA GT - #CCGGCAGT     1140     #  1155     - (2) INFORMATION FOR SEQ ID NO:19:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1143 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:     - GTGTTCAATG CCAAGGACTT CGGCGCGCTG GGCGACGGCG CCAGCGACGA CC - #GGCCGGCC       60     - ATCCAGGCGG CGATCGACGC CGCCTACGCG GCCGGTGGCG GCACCGTCTA CC - #TGCCGGCC      120     - GGCGAGTACC GGGTCAGCCC CACCGGGGAG CCGGGCGACG GCTGCCTGAT GC - #TCAAGGAC      180     - GGCGTCTACC TGGCCGGCGA CGGCATAGGC GAAACGGTCA TCAAGCTGAT CG - #ACGGCTCC      240     - GACCAGAAGA TCACCGGCAT GGTGCGCTCG GCCTATGGCG AAGAGACCAG CA - #ACTTCGGC      300     - ATGAGCGACC TGACCCTCGA CGGCAACCGC GACAACACCA GCGGCAAGGT CG - #ACGGCTGG      360     - TTCAACGGCT ACATCCCCGG CCAGGACGGC GCCGACCGCA ACGTGACCAT CG - #AGCGGGTG      420     - GAAATCCGCG AGATGTCCGG CTATGGCTTC GATCCGCACG AGCAGACCAT CA - #ACCTGACG      480     - ATCCGCGACA GCGTGGCCCA CGACAACGGC CTCGACGGCT TCGTCGCCGA CT - #ACCTGGTC      540     - GACAGCGTGT TCGAGAACAA CGTCGCCTAC AACAACGACC GCCACGGCTT CA - #ACATCGTC      600     - ACCAGCACCT ACGATTTCGT CATGACCAAC AACGTCGCCT ACGGCAACGG CG - #GCGCCGGC      660     - CTGACGATCC AGCGGGGCTC GGAGGACCTG GCCCAGCCGA CCGATATCCT GA - #TCGACGGC      720     - GGCGCCTACT ACGACAACGC CCTGGAAGGC GTGCTGTTCA AGATGACCAA CA - #ACGTCACC      780     - CTGCAGAACG CCGAGATCTA CGGCAACGGC TCCTCCGGCG TGCGCCTGTA CG - #GCACGGAG      840     - GACGTGCAGA TCCTCGACAA CCAGATCCAC GACAATTCGC AGAACGGCAC CT - #ATCCGGAA      900     - GTCCTGCTGC AGGCCTTCGA CGACAGCCAG GTCACCGGTG AGCTGTACGA GA - #CCCTGAAC      960     - ACCCGGATCG AAGGCAATCT CATCGACGCT TCGGACAACG CCAACTATGC GG - #TGCGCGAG     1020     - CGCGACGACG GCAGCGACTA CACCACGCTC GTGGACAACG ACATCAGCGG CG - #GCCAGGTC     1080     - GCCTCGGTGC AGCTTTCCGG CGCCCATTCG AGTCTTTCCG GCGGCACCGT CG - #AAGTGCCG     1140     #           1143     - (2) INFORMATION FOR SEQ ID NO:20:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1155 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:     - ATGGATTACA ACGTCAAAGA TTTCGGGGCG CTGGGCGATG GCGTCAGCGA CG - #ATACGGCC       60     - GCCATCCAGG CGGCGATCGA TGCCGCCTAC GCGGCCGGCG GCGGCACCGT CT - #ACCTGCCG      120     - GCCGGCGAAT ACCGGGTCAG CGGCGGCGAG GAGCCTTCCG ATGGTTGCCT GA - #CCATCAAG      180     - AGCAACGTCC ATATCGTCGG CGCGGGGATG GGCGAGACGG TCATCAAGCT GG - #TCGACGGC      240     - TGGGATCAGG ACGTCACCGG CATCGTCCGC TCGGCCTACG GCGAGGAGAC CA - #GCAACTTC      300     - GGCATGAGCG ACCTGACCCT CGACGGCAAC CGCGACAACA CCAGCGGCAA GG - #TCGACGGC      360     - TGGTTCAACG GCTACATTCC CGGCGAGGAC GGCGCCGACC GCGACGTGAC CC - #TGGAGCGG      420     - GTGGAAATCC GTGAAATGTC CGGTTACGGT TTCGATCCGC ACGAGCAGAC CA - #TCAACCTG      480     - ACGATCCGCG ACAGCGTGGC CCACGACAAC GGCCTCGACG GCTTCGTCGC CG - #ATTTCCAG      540     - ATCGGCGGGG TGTTCGAGAA CAACGTCTCG TACAACAACG ACCGCCACGG CT - #TCAACATC      600     - GTCACCAGCA CCAACGACTT CGTCCTGAGC AACAACGTCG CCTACGGCAA CG - #GCGGCGCC      660     - GGCCTGGTGG TGCAGCGCGG CTCGTCCGAC GTGGCGCACC CCTACGACAT CC - #TGATCGAC      720     - GGCGGCGCCT ACTACGACAA CGGCCTGGAA GGCGTGCAGA TCAAGATGGC CC - #ACGACGTC      780     - ACCCTGCAGA ACGCCGAGAT CTACGGCAAC GGCCTATACG GGGTGCGCGT CT - #ACGGCGCC      840     - GAGGATGTGC AGATCCTCGA CAACTACATC CACGACAATT CGCAGAACGG TT - #CCTACGCG      900     - GAAATCCTCC TGCAGTCCTA CGACGATACC GCCGGGGTGT CCGGCAATTT CT - #ACACCACC      960     - ACCGGCACCT GGATCGAAGG CAACACCATC GTCGGCTCGG CCAACTCCAC CT - #ATGGCATC     1020     - CAGGAGCGCG ACGACGGCAC CGACTACAGC AGCCTCTACG CCAACAGCGT CA - #GCAATGTG     1080     - CAGAACGGCT CGGTGCGCCT CTACGGCGCC AACTCCGTCG TCTCCGACCT GC - #CCGGCACC     1140     #  1155     - (2) INFORMATION FOR SEQ ID NO:21:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1155 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:     - ATGGACTTCA ACGTCAAAGA TTTCGGGGCA CTGGGCGATG GCGCCAGCGA CG - #ACACGGCG       60     - GCCATCCAGG CGGCGATCGA TGCCGCCCAC GCGGCGGGCG GCGGCACCGT CT - #ACCTGCCG      120     - GCTGGCGAGT ATCGGGTCAG CGGCGGCGAG GAGCCTTCCG ACGGCGCGCT GA - #CCATCAAG      180     - AGCAACGTCT ATATCGTCGG CGCCGGGATG GGCGAGACGG TGATCAAGAT GG - #TCGACGGC      240     - TGGACGCAGA ACGTCACCGG CATGGTGCGC TCGGCCTATG GCGAGGAGAC CA - #GCAACTTC      300     - GGCATGAGCG ACCTGACCCT CGACGGCAAC CGCGACAACC TGTCCGCCAA GG - #TCGACGGC      360     - TGGTTCAACG GCTACATTCC CGGCCAGGAC GGTGCCGATC GCGACGTGAC CC - #TGGAGCGG      420     - GTGGAAATCC GCGAAATGTC CGGTTACGGT TTCGATCCGC ACGAGCAGAC CA - #TCAACCTG      480     - ACGATCCGCG ACAGCGTGGC CCACGACAAC GGCCTCGACG GCTTCGTCGC CG - #ACTACCAG      540     - GTCGGCGGGG TGTTCGAGAA CAACGTCTCG TACAACAACG ACCGCCACGG CT - #TCAACATC      600     - GTCACCAGCA CCAACGACTT CGTCCTGAGC AACAACGTCG CCTACGGCAA CG - #GCGGCGCC      660     - GGCCTGGTGG TGCAGCGCGG CTCGTACGAC CTGCCCCATC CCTACGACAT CC - #TGATCGAC      720     - GGCGGCGCCT ACTACGACAA CGCCTTGGAA GGCGTGCAGC TCAAGATGAC CC - #ACGACGTC      780     - ACCCTGCAGA ACGCCGAGAT CTATGGCAAC GGCCTGTACG GGGTGCGCGT CT - #ACGGCGCC      840     - CAGGACGTGC AACTCCTCGA TAACCAGATC CACGACAATT CGCAGAACGG CG - #CCTATGCC      900     - GAAGTCCTGC TGCAGTCCTA CGACGACACC GCCGGGGTGT CCGGCAACTT TT - #ACGTCACC      960     - ACCGGCACCT GGCTCGAAGG CAACGTCATC AGCGGCTCGG CCAATTCCAC CT - #TCGGCATC     1020     - CAGGAGCGCG CCGACGGCAC CGACTACAGC AGCCTTTACG CCAATACCAT CG - #ACGGCGTG     1080     - CAGAACGGGA CGGTACGGCT GTATGGCGCC AACTCCACGG TTTCCGAGCA GC - #CCAGCAGC     1140     #  1155     - (2) INFORMATION FOR SEQ ID NO:22:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 87 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:     - GTGTTCAATG CCAAGGACTT CGGCGCGCTG GGCGACGGCG CCAGCGACGA CC - #GGCCGGCC       60     #             87   ACGC CGCCTAC     - (2) INFORMATION FOR SEQ ID NO:23:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 392 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:     - Met Asp Tyr Asn Val Lys Asp Phe Gly Ala Le - #u Gly Asp Gly Val Ser     #                15     - Asp Asp Arg Ala Ser Ile Gln Ala Ala Ile As - #p Ala Ala Tyr Ala Ala     #            30     - Gly Gly Gly Thr Val Tyr Leu Pro Ala Gly Gl - #u Tyr Arg Val Ser Ala     #        45     - Ala Gly Glu Pro Gly Asp Gly Cys Leu Met Le - #u Lys Asp Gly Val Tyr     #    60     - Leu Ala Gly Ala Gly Met Gly Glu Thr Val Il - #e Lys Leu Ile Asp Gly     #80     - Ser Asp Gln Lys Ile Thr Gly Met Val Arg Se - #r Ala Tyr Gly Glu Glu     #                95     - Thr Ser Asn Phe Gly Met Arg Asp Leu Thr Le - #u Asp Gly Asn Arg Asp     #           110     - Asn Thr Ser Gly Lys Val Asp Gly Trp Phe As - #n Gly Tyr Ile Pro Gly     #       125     - Gly Asp Gly Ala Asp Arg Asp Val Thr Ile Gl - #u Arg Val Glu Val Arg     #   140     - Glu Met Ser Gly Tyr Gly Phe Asp Pro His Gl - #u Gln Thr Ile Asn Leu     145                 1 - #50                 1 - #55                 1 -     #60     - Thr Ile Arg Asp Ser Val Ala His Asp Asn Gl - #y Leu Asp Gly Phe Val     #               175     - Ala Asp Tyr Leu Val Asp Ser Val Phe Glu As - #n Asn Val Ala Tyr Ala     #           190     - Asn Asp Arg His Gly Phe Asn Val Val Thr Se - #r Thr His Asp Phe Val     #       205     - Met Thr Asn Asn Val Ala Tyr Gly Asn Gly Se - #r Ser Gly Leu Val Val     #   220     - Gln Arg Gly Leu Glu Asp Leu Ala Leu Pro Se - #r Asn Ile Leu Ile Asp     225                 2 - #30                 2 - #35                 2 -     #40     - Gly Gly Ala Tyr Tyr Asp Asn Ala Arg Glu Gl - #y Val Leu Leu Lys Met     #               255     - Thr Ser Asp Ile Thr Leu Gln Asn Ala Asp Il - #e His Gly Asn Gly Ser     #           270     - Ser Gly Val Arg Val Tyr Gly Ala Gln Asp Va - #l Gln Ile Leu Asp Asn     #       285     - Gln Ile His Asp Asn Ala Gln Ala Ala Ala Va - #l Pro Glu Val Leu Leu     #   300     - Gln Ser Phe Asp Asp Thr Ala Gly Ala Ser Gl - #y Thr Tyr Tyr Thr Thr     305                 3 - #10                 3 - #15                 3 -     #20     - Leu Asn Thr Arg Ile Glu Gly Asn Thr Ile Se - #r Gly Ser Ala Asn Ser     #               335     - Thr Tyr Gly Ile Gln Glu Arg Asn Asp Gly Th - #r Asp Tyr Ser Ser Leu     #           350     - Ile Asp Asn Asp Ile Ala Gly Val Gln Gln Pr - #o Ile Gln Leu Tyr Gly     #       365     - Pro His Ser Thr Val Ser Gly Glu Pro Gly Al - #a Thr Pro Gln Gln Pro     #   380     - Ser Thr Gly Ser Asp Gly Glu Pro     385                 3 - #90     - (2) INFORMATION FOR SEQ ID NO:24:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 385 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:     - Met Asp Tyr Asn Val Lys Asp Phe Gly Ala Le - #u Gly Asp Gly Val Ser     #                15     - Asp Asp Thr Ala Ala Ile Gln Ala Ala Ile As - #p Ala Ala His Ala Ala     #            30     - Gly Gly Gly Thr Val Tyr Leu Pro Ala Gly Gl - #u Tyr Arg Val Ser Gly     #        45     - Gly Glu Glu Pro Ser Asp Gly Cys Leu Thr Il - #e Lys Ser Asn Val His     #    60     - Ile Val Gly Ala Gly Met Gly Glu Thr Val Il - #e Lys Met Val Asp Gly     #80     - Trp Thr Gln Asn Val Thr Gly Met Val Arg Se - #r Ala Tyr Gly Glu Glu     #                95     - Thr Ser Asn Phe Gly Met Ser Asp Leu Thr Le - #u Asp Gly Asn Arg Asp     #           110     - Asn Leu Ser Ala Lys Val Asp Gly Trp Phe As - #n Gly Tyr Ile Pro Gly     #       125     - Gln Asp Gly Ala Asp Arg Asp Val Thr Leu Gl - #u Arg Val Glu Ile Arg     #   140     - Glu Met Ser Gly Tyr Gly Phe Asp Pro His Gl - #u Gln Thr Ile Asn Leu     145                 1 - #50                 1 - #55                 1 -     #60     - Thr Ile Arg Asp Ser Val Ala His Asp Asn Se - #r Leu Asp Gly Phe Val     #               175     - Ala Asp Tyr Gln Val Gly Gly Val Phe Glu As - #n Asn Val Ser Tyr Asn     #           190     - Asn Asp Arg His Gly Phe Asn Ile Val Thr Se - #r Thr Asn Asp Phe Val     #       205     - Leu Ser Asn Asn Val Ala Tyr Gly Asn Gly Gl - #y Ala Gly Leu Val Val     #   220     - Gln Arg Gly Ser Tyr Asp Leu Pro His Pro Ty - #r Asp Ile Leu Ile Asp     225                 2 - #30                 2 - #35                 2 -     #40     - Gly Gly Ala Tyr Tyr Asp Asn Ala Leu Glu Gl - #y Val Gln Leu Lys Met     #               255     - Ala His Asp Val Thr Leu Gln Asn Ala Glu Il - #e Tyr Gly Asn Gly Leu     #           270     - Tyr Gly Val Arg Val Tyr Gly Ala Gln Asp Va - #l Gln Ile Leu Asp Asn     #       285     - Gln Ile His Asp Asn Ser Gln Asn Gly Ala Ty - #r Ala Glu Val Leu Leu     #   300     - Gln Ser Tyr Asp Asp Thr Ala Gly Val Ser Gl - #y Asn Phe Tyr Val Thr     305                 3 - #10                 3 - #15                 3 -     #20     - Thr Gly Thr Trp Leu Glu Gly Asn Val Ile Se - #r Gly Ser Ala Asn Ser     #               335     - Thr Tyr Gly Ile Gln Glu Arg Ala Asp Gly Th - #r Asp Tyr Ser Ser Leu     #           350     - Tyr Ala Asn Ser Ile Asp Gly Val Gln Thr Gl - #y Ala Val Arg Leu Tyr     #       365     - Gly Ala Asn Ser Thr Val Ser Ser Gln Ser Gl - #y Ser Gly Gln Gln Ala     #   380     - Thr     - 385     - (2) INFORMATION FOR SEQ ID NO:25:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 381 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:     - Val Phe Asn Ala Lys Asp Phe Gly Ala Leu Gl - #y Asp Gly Ala Ser Asp     #                15     - Asp Arg Pro Ala Ile Gln Ala Ala Ile Asp Al - #a Ala Tyr Ala Ala Gly     #            30     - Gly Gly Thr Val Tyr Leu Pro Ala Gly Glu Ty - #r Arg Val Ser Pro Thr     #        45     - Gly Glu Pro Gly Asp Gly Cys Leu Met Leu Ly - #s Asp Gly Val Tyr Leu     #    60     - Ala Gly Asp Gly Ile Gly Glu Thr Val Ile Ly - #s Leu Ile Asp Gly Ser     #80     - Asp Gln Lys Ile Thr Gly Met Val Arg Ser Al - #a Tyr Gly Glu Glu Thr     #                95     - Ser Asn Phe Gly Met Ser Asp Leu Thr Leu As - #p Gly Asn Arg Asp Asn     #           110     - Thr Ser Gly Lys Val Asp Gly Trp Phe Asn Gl - #y Tyr Ile Pro Gly Gln     #       125     - Asp Gly Ala Asp Arg Asn Val Thr Ile Glu Ar - #g Val Glu Ile Arg Glu     #   140     - Met Ser Gly Tyr Gly Phe Asp Pro His Glu Gl - #n Thr Ile Asn Leu Thr     145                 1 - #50                 1 - #55                 1 -     #60     - Ile Arg Asp Ser Val Ala His Asp Asn Gly Le - #u Asp Gly Phe Val Ala     #               175     - Asp Tyr Leu Val Asp Ser Val Phe Glu Asn As - #n Val Ala Tyr Asn Asn     #           190     - Asp Arg His Gly Phe Asn Ile Val Thr Ser Th - #r Tyr Asp Phe Val Met     #       205     - Thr Asn Asn Val Ala Tyr Gly Asn Gly Gly Al - #a Gly Leu Thr Ile Gln     #   220     - Arg Gly Ser Glu Asp Leu Ala Gln Pro Thr As - #p Ile Leu Ile Asp Gly     225                 2 - #30                 2 - #35                 2 -     #40     - Gly Ala Tyr Tyr Asp Asn Ala Leu Glu Gly Va - #l Leu Phe Lys Met Thr     #               255     - Asn Asn Val Thr Leu Gln Asn Ala Glu Ile Ty - #r Gly Asn Gly Ser Ser     #           270     - Gly Val Arg Leu Tyr Gly Thr Glu Asp Val Gl - #n Ile Leu Asp Asn Gln     #       285     - Ile His Asp Asn Ser Gln Asn Gly Thr Tyr Pr - #o Glu Val Leu Leu Gln     #   300     - Ala Phe Asp Asp Ser Gln Val Thr Gly Glu Le - #u Tyr Glu Thr Leu Asn     305                 3 - #10                 3 - #15                 3 -     #20     - Thr Arg Ile Glu Gly Asn Leu Ile Asp Ala Se - #r Asp Asn Ala Asn Tyr     #               335     - Ala Val Arg Glu Arg Asp Asp Gly Ser Asp Ty - #r Thr Thr Leu Val Asp     #           350     - Asn Asp Ile Ser Gly Gly Gln Val Ala Ser Va - #l Gln Leu Ser Gly Ala     #       365     - His Ser Ser Leu Ser Gly Gly Thr Val Glu Va - #l Pro Gln     #   380     - (2) INFORMATION FOR SEQ ID NO:26:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 385 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:     - Met Asp Tyr Asn Val Lys Asp Phe Gly Ala Le - #u Gly Asp Gly Val Ser     #                15     - Asp Asp Thr Ala Ala Ile Gln Ala Ala Ile As - #p Ala Ala Tyr Ala Ala     #            30     - Gly Gly Gly Thr Val Tyr Leu Pro Ala Gly Gl - #u Tyr Arg Val Ser Gly     #        45     - Gly Glu Glu Pro Ser Asp Gly Cys Leu Thr Il - #e Lys Ser Asn Val His     #    60     - Ile Val Gly Ala Gly Met Gly Glu Thr Val Il - #e Lys Leu Val Asp Gly     #80     - Trp Asp Gln Asp Val Thr Gly Ile Val Arg Se - #r Ala Tyr Gly Glu Glu     #                95     - Thr Ser Asn Phe Gly Met Ser Asp Leu Thr Le - #u Asp Gly Asn Arg Asp     #           110     - Asn Thr Ser Gly Lys Val Asp Gly Trp Phe As - #n Gly Tyr Ile Pro Gly     #       125     - Glu Asp Gly Ala Asp Arg Asp Val Thr Leu Gl - #u Arg Val Glu Ile Arg     #   140     - Glu Met Ser Gly Tyr Gly Phe Asp Pro His Gl - #u Gln Thr Ile Asn Leu     145                 1 - #50                 1 - #55                 1 -     #60     - Thr Ile Arg Asp Ser Val Ala His Asp Asn Gl - #y Leu Asp Gly Phe Val     #               175     - Ala Asp Phe Gln Ile Gly Gly Val Phe Glu As - #n Asn Val Ser Tyr Asn     #           190     - Asn Asp Arg His Gly Phe Asn Ile Val Thr Se - #r Thr Asn Asp Phe Val     #       205     - Leu Ser Asn Asn Val Ala Tyr Gly Asn Gly Gl - #y Ala Gly Leu Val Val     #   220     - Gln Arg Gly Ser Ser Asp Val Ala His Pro Ty - #r Asp Ile Leu Ile Asp     225                 2 - #30                 2 - #35                 2 -     #40     - Gly Gly Ala Tyr Tyr Asp Asn Gly Leu Glu Gl - #y Val Gln Ile Lys Met     #               255     - Ala His Asp Val Thr Leu Gln Asn Ala Glu Il - #e Tyr Gly Asn Gly Leu     #           270     - Tyr Gly Val Arg Val Tyr Gly Ala Glu Asp Va - #l Gln Ile Leu Asp Asn     #       285     - Tyr Ile His Asp Asn Ser Gln Asn Gly Ser Ty - #r Ala Glu Ile Leu Leu     #   300     - Gln Ser Tyr Asp Asp Thr Ala Gly Val Ser Gl - #y Asn Phe Tyr Thr Thr     305                 3 - #10                 3 - #15                 3 -     #20     - Thr Gly Thr Trp Ile Glu Gly Asn Thr Ile Va - #l Gly Ser Ala Asn Ser     #               335     - Thr Tyr Gly Ile Gln Glu Arg Asp Asp Gly Th - #r Asp Tyr Ser Ser Leu     #           350     - Tyr Ala Asn Ser Val Ser Asn Val Gln Asn Gl - #y Ser Val Arg Leu Tyr     #       365     - Gly Ala Asn Ser Val Val Ser Asp Leu Pro Gl - #y Thr Gly Gln Gln Ala     #   380     - Thr     - 385     - (2) INFORMATION FOR SEQ ID NO:27:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 385 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:     - Met Asp Phe Asn Val Lys Asp Phe Gly Ala Le - #u Gly Asp Gly Ala Ser     #                15     - Asp Asp Thr Ala Ala Ile Gln Ala Ala Ile As - #p Ala Ala His Ala Ala     #            30     - Gly Gly Gly Thr Val Tyr Leu Pro Ala Gly Gl - #u Tyr Arg Val Ser Gly     #        45     - Gly Glu Glu Pro Ser Asp Gly Ala Leu Thr Il - #e Lys Ser Asn Val Tyr     #    60     - Ile Val Gly Ala Gly Met Gly Glu Thr Val Il - #e Lys Met Val Asp Gly     #80     - Trp Thr Gln Asn Val Thr Gly Met Val Arg Se - #r Ala Tyr Gly Glu Glu     #                95     - Thr Ser Asn Phe Gly Met Ser Asp Leu Thr Le - #u Asp Gly Asn Arg Asp     #           110     - Asn Leu Ser Ala Lys Val Asp Gly Trp Phe As - #n Gly Tyr Ile Pro Gly     #       125     - Gln Asp Gly Ala Asp Arg Asp Val Thr Leu Gl - #u Arg Val Glu Ile Arg     #   140     - Glu Met Ser Gly Tyr Gly Phe Asp Pro His Gl - #u Gln Thr Ile Asn Leu     145                 1 - #50                 1 - #55                 1 -     #60     - Thr Ile Arg Asp Ser Val Ala His Asp Asn Gl - #y Leu Asp Gly Phe Val     #               175     - Ala Asp Tyr Gln Val Gly Gly Val Phe Glu As - #n Asn Val Ser Tyr Asn     #           190     - Asn Asp Arg His Gly Phe Asn Ile Val Thr Se - #r Thr Asn Asp Phe Val     #       205     - Leu Ser Asn Asn Val Ala Tyr Gly Asn Gly Gl - #y Ala Gly Leu Val Val     #   220     - Gln Arg Gly Ser Tyr Asp Leu Pro His Pro Ty - #r Asp Ile Leu Ile Asp     225                 2 - #30                 2 - #35                 2 -     #40     - Gly Gly Ala Tyr Tyr Asp Asn Ala Leu Glu Gl - #y Val Gln Leu Lys Met     #               255     - Thr His Asp Val Thr Leu Gln Asn Ala Glu Il - #e Tyr Gly Asn Gly Leu     #           270     - Tyr Gly Val Arg Val Tyr Gly Ala Gln Asp Va - #l Gln Leu Leu Asp Asn     #       285     - Gln Ile His Asp Asn Ser Gln Asn Gly Ala Ty - #r Ala Glu Val Leu Leu     #   300     - Gln Ser Tyr Asp Asp Thr Ala Gly Val Ser Gl - #y Asn Phe Tyr Val Thr     305                 3 - #10                 3 - #15                 3 -     #20     - Thr Gly Thr Trp Leu Glu Gly Asn Val Ile Se - #r Gly Ser Ala Asn Ser     #               335     - Thr Phe Gly Ile Gln Glu Arg Ala Asp Gly Th - #r Asp Tyr Ser Ser Leu     #           350     - Tyr Ala Asn Thr Ile Asp Gly Val Gln Asn Gl - #y Thr Val Arg Leu Tyr     #       365     - Gly Ala Asn Ser Thr Val Ser Glu Gln Pro Se - #r Ser Gly Gln Gln Ala     #   380     - Thr     - 385     - (2) INFORMATION FOR SEQ ID NO:28:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 29 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:     - Val Phe Asn Ala Lys Asp Phe Gly Ala Leu Gl - #y Asp Gly Ala Ser Asp     #                15     - Asp Arg Pro Ala Ile Gln Ala Ala Ile Asp Al - #a Ala Tyr     #            25     - (2) INFORMATION FOR SEQ ID NO:29:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 426 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:     - CTGGTCGGCG GCGACACGGA CGACCAGCTC CAGGGCGGCT CCGGCGCCGA TC - #GCCTGGAC       60     - GGCGGGGCCG GCGACGACAT CCTCGACGGC GGCGCCGGGC GCGACCGGCT GA - #GCGGCGGC      120     - GCGGGCGCCG ACACCTTCGT GTTCTCCGCC CGCGAGGACA GCTACCGTAC CG - #ACACGGCG      180     - GTGTTCAACG ACCTGATCCT CGACTTCGAG GCCAGCGAGG ATCGCATCGA CC - #TGTCCGCG      240     - CTGGGCTTTT CCGGCCTGGG CGACGGCTAT GGCGGCACCC TGCTCCTGAA GA - #CCAACGCC      300     - GAGGGCACGC GCACCTACCT GAAAAGCTTC GAGGCGGATG CCGAGGGACG GC - #GCTTCGAG      360     - GTCGCCCTGG ACGGCGACCA CACGGGCGAT CTTTCCGCCG CCAATGTGGT CT - #TCGCCGCG      420     #          426     - (2) INFORMATION FOR SEQ ID NO:30:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 453 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:     - CTCGAAGGCA GCGCGGGCAA CGATGCGCTG AGCGGGACCG AGGCCCACGA GA - #CGCTGCTC       60     - GGCCAGGCCG GCGACGACCG CCTGAACGGC GATGCCGGCA ACGACATCCT CG - #ACGGCGGG      120     - GCAGGGCGCG ACAACCTGAC CGGCGGCGCG GGCGCCGACA CCTTCCGCTT CT - #CCGCGCGC      180     - ACCGACAGCT ACCGCACCGA CAGCGCCAGC TTCAACGACC TGATCACCGA CT - #TCGACGCC      240     - GACGAGGACA GCATCGACCT GTCCGCGCTG GGCTTCACCG GCCTGGGCGA CG - #GCTACAAT      300     - GGCACCCTGC TGCTGAAGAC CAACGCCGAG GGTACGCGCA CCTACCTGAA GA - #GCTACGAA      360     - GCGGACGCCC AGGGCCGGCG CTTCGAGATC GCCCTGGACG GCAACTTCAC CG - #GTCTGTTC      420     #        453       TGTT CGACGCCGCT CCG     - (2) INFORMATION FOR SEQ ID NO:31:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 459 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:     - GCCACCGGTA CCGAGGGCAG CGACAACCTG CTCGGCACCG ACGCCGGGGA AA - #CCCTCCTG       60     - GGCTACGGCG GCAACGACAC CCTCAACGGC GGGGCCGGCG ACGACATCCT GG - #TCGGCGGC      120     - GCCGGGCGCG ACAGCCTGAC CGGCGGCGCC GGGGCGGACG TGTTCCGCTT CG - #ACGCGCTG      180     - TCCGACAGCC AGCGCAACTA CACCACCGGC GACAACCAGG CCGACCGCAT TC - #TCGACTTC      240     - GACCCGACCC TGGACAGGAT CGACGTGTCG GCGCTGGGCT TCACCGGGCT GG - #GCAACGGC      300     - CGCAACGGCA CCCTCGCCGT GGTGCTCAAC AGCGCCGGCG ACCGCACCGA TC - #TGAAGAGC      360     - TACGACACCG ACGCCAACGG CTACAGCTTC GAGCTTTCCC TCGCGGGCAA CT - #ACCAGGGG      420     #   459            AGTT CGTTTTCGCG ACGTCTCAG     - (2) INFORMATION FOR SEQ ID NO:32:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 450 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:     - ATCGAAGGCA CCGACGGCAA CGATACCTTG CAGGGCACCG AGGCCAACGA GC - #GGCTCCTC       60     - GGCCTGGACG GCCGGGACAA CCTGAACGGC GGCGCCGGCG ACGACATCCT CG - #ACGGCGGA      120     - GCGGGGCGCG ACACCCTGAC CGGCGGCACG GGGGCCGACA CCTTCCTGTT CT - #CCACGCGT      180     - ACCGACAGCT ACCGCACCGA CAGCGCCAGC TTCAACGACC TGATCACCGA CT - #TCGATCCC      240     - ACCCAGGACC GCATCGACCT GTCCGGCCTG GGCTTCAGCG GTTTCGGCAA CG - #GCTACGAC      300     - GGCACCCTGC TGCTGCAGGT CAACGCCGCG GGCACCCGCA CCTACCTGAA GA - #GTTTCGAG      360     - GCCGATGCCA ACGGCCAGCG CTTCGAGATC GCCCTGGACG GCGACTTCAG CG - #GCCAATTG      420     #          450     TCTT CGAGCCCGCC     - (2) INFORMATION FOR SEQ ID NO:33:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 447 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:     - GGGACCGACG GCAACGACGT GCTGGTCGGC AGCGATGCCA ACGACCAGCT CT - #ACGGCGGA       60     - GCCGGCGACG ACCGCCTGGA CGGCGGCGCC GGTGACGACC TGCTCGACGG CG - #GAGCGGGG      120     - CGCGACGACC TGACCGGCGG CACGGGTGCC GACACCTTCG TGTTCGCCGC GC - #GTACCGAT      180     - AGCTACCGCA CCGACGCGGG GGTGTTCAAC GACCTGATCC TCGACTTCGA CG - #CCAGCGAG      240     - GACCGCATCG ACCTGTCCGC CCTGGGTTTC AGCGGCTTCG GCGACGGCTA CA - #ACGGCACC      300     - CTGCTGGTGC AGCTCAGCAG CGCCGGAACC CGTACCTACC TCAAGAGCTA CG - #AGGAGGAC      360     - CTCGAGGGCC GGCGCTTCGA GGTCGCCCTG GACGGCGACC ACACGGGCGA TC - #TTTCCGCC      420     #            447   CCGA CGACGGC     - (2) INFORMATION FOR SEQ ID NO:34:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 453 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:     - CTCGAAGGCA CGGCCGGCAA CGACACGCTT GGCGGCAGCG ACGCCCACGA GA - #CGCTGCTC       60     - GGGCTGGACG GCAACGACCG CCTGAACGGC GGCGCCGGCA ACGACATCCT CG - #ACGGCGGC      120     - GCCGGGCGCG ACAACCTGAC CGGCGGCGCG GGCGCCGACC TGTTCCGCGT CT - #CCGCGCGC      180     - ACCGACAGCT ACCGCACCGA CAGCGCCAGC TTCAACGACC TGATCACCGA CT - #TCGACGCC      240     - AGCCAGGACC GCATCGACCT GTCCGCGCTG GGCTTCACCG GGCTGGGCGA CG - #GCTATAAC      300     - GGCACCCTGC TGCTGCAGGT CAGCGCCGAC GGCAGCCGCA CCTATCTGAA GA - #GCCTGGAG      360     - GCGGATGCCG AGGGGCGGCG TTTCGAGATC GCCCTGGACG GCAACTTCGC CG - #GCCTGCTC      420     #        453       TCTT CGAGCGCACC GCC     - (2) INFORMATION FOR SEQ ID NO:35:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 459 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35:     - ATCGAGGGGG ATGCCGGCGA CAACGCCCTG CTCGGTACCT CGGCCGCCGA GA - #CATTGCTC       60     - GGCCACGCCG GCAACGACAC GCTCGACGGC GGGGCCGGCG ACGACATCCT GG - #TCGGCGGC      120     - GCCGGGCGCG ACAGCCTCAC CGGCGGCGCC GGAGCGGACG TGTTCCGCTT CG - #ACGCGCTG      180     - TCCGACAGCC AGCGCAACTA CGACATCGGC GACAACCAGG GCGACCGCAT CG - #CCGACTTC      240     - GCGGTGGGCG AAGACAAGCT CGACGTATCG GCGCTGGGCT TCACCGGGCT GG - #GCGACGGC      300     - TACAACGGCA CCCTCGCCCT GGTGCTCAAC AGCGCCGGCG ACCGCACCTA CG - #TGAAAAGC      360     - TACGAGAACG GCGCCGACGG CTACCGCTTC GAGTTTTCCC TCGACGGCAA CT - #ATCTGGAG      420     #   459            ATTT CATCTTCGCC ACGCCCAGC     - (2) INFORMATION FOR SEQ ID NO:36:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 396 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36:     - CTCGAAGGCA GCGCCGGCAA CGACAGCCTG CAGGGCACGG CCGCCGACGA GG - #TGATCCAC       60     - GGCGGCGGCG GGCGCGACAC GCTGGCCGGA GGGGCCGGGG CCGACGTGTT CC - #GCTTTAGC      120     - GAACTGACCG ACAGCTACCG AGACAGTGCC AGCTATGCCG ATCTGATCAC TG - #ACTTCGAT      180     - GCCAGCGAGG ATCGTATCGA CCTGTCCGGC CTCGGCTTCA GCGGTCTGGG CA - #ACGGCTAC      240     - GGCGGTACCC TGGCGCTGCA GGTGAACAGC GCCGGTACCC GCACCTACCT GA - #AGAGCTTC      300     - GAGACCAACG CCGCCGGCGA GCGTTTCGAG ATCGCCCTGG ACGGCGACCT GT - #CCGCGCTC      360     #      396         TCCT CGACGCGCGT ACCGTA     - (2) INFORMATION FOR SEQ ID NO:37:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 459 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37:     - CTGGCGGGCG GCGACGGCAA CGACACGCTT TCCGGCAGCA GCGCGGCCGA GG - #AACTGCTC       60     - GGCGGGGTCG GCAACGACAG CCTGGACGGC GGCGCCGGCA ACGACATCCT CG - #ACGGCGGG      120     - GCGGGGCGCG ACACCCTGAG TGGCGGCAGC GGCAGCGACA TCTTCCGCTT CG - #GCGGCGCG      180     - CTCGACAGCT TCCGCAACTA CGCCAGCGGG ACGAACGGCA CCGACAGCAT CA - #CCGACTTC      240     - ACCCCCGGCG AGGATCTGAT CGACCTCTCC GTGCTCGGCT ACACCGGGCT GG - #GCGACGGC      300     - TACAACGGTA CCCTGGCGAT AGTGCTGAAC GACGCCGGCA CCAAGACCTA CC - #TGAAAAAC      360     - CGCGAGAGCG ACGCCGAAGG CAACCAGTTC GAGATCGCCC TGGAGGGCAA CC - #ACGCCGAC      420     #   459            ACTT CATCTTCGCC ACGGCGGCC     - (2) INFORMATION FOR SEQ ID NO:38:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 453 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:     - CTCGAAGGCA CCGCGGGCAA CGACGTGCTC AGCGGAACGG GTGCCCACGA GC - #TGATTCTC       60     - GGCCTGGCCG GCAACGATCG CCTGGACGGT GGCGCCGGCG ACGACACCCT CG - #ACGGCGGC      120     - GCGGGGCGCG ATACCCTGAC CGGCGGCGCG GGCGCCGATA CCTTCCGCTT CT - #CTGCCCGC      180     - GAGGACAGTC ACCGCACCGA CAGCGCCAGC TTCACCGACC TGATCACCGA CT - #TCGACGCC      240     - AGCCAGGACC GCATCGACCT CTCCGCGCTG GGCTTCACCG GTCTGGGCAA CG - #GTTATGAC      300     - GGCACCCTGG CGGTGACCAC CGGTTCCGGC GGCACCCGCA CCTACCTGAA GA - #GCTACGAG      360     - GTGGACGCCC AGGGCCGGCG TTTCGAAATC GCCCTGGACG GCAACTTCGT CG - #GCCAGTTC      420     #        453       TGTT CGACGCCGCT CCG     - (2) INFORMATION FOR SEQ ID NO:39:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 459 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39:     - GTCACCGGTA CCGAGGGCAA CGACAACCTG TCCGGCACCG ATGCCGGGGA AA - #CCCTCCTG       60     - GGCTACGGCG GCAACGACAC CCTCAACGGC GGGGCCGGCA ACGACATCCT GG - #TCGGCGGC      120     - GCCGGGCGCG ACACCCTGAC CGGCGGCGCC GGGGCGGACG TGTTCCGCTT CG - #AGGCGCTG      180     - TCCGACAGCC AGCGCAACTA CACCGCCGGC GACAACCAGG GCGATTACAT CA - #TCGACTTC      240     - GCCGTGGGCG AAGACAGGAT CGACGTATCG GCGCTGGGTT ACACCGGGCT GG - #GCAACGGC      300     - CGCAACGGCA CCCTCGCCGT GGTGCTCAAC AGCGCCGGCG ACCGCACCTA CG - #TGAAGAGC      360     - TACGACACTG ACGCCAACGG CTATAACTTC GAGCTTTCCC TCGCGGGCAA CT - #ACCAGGGG      420     #   459            AGTT CGTCTTCGCC ACGCCCCCG     - (2) INFORMATION FOR SEQ ID NO:40:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 450 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40:     - ATCGAGGGAA CCGACGGCAA CGACAGCTTG CAAGGGACCG GGGCCGACGA AC - #TGCTCCTC       60     - GGTCTGGGCG GCCGGGACAG CCTGAACGGC GGCGCCGGCG ACGATGTCCT GG - #ATGGCGGG      120     - GCGGAGCGCG ACACCCTGAC CGGCGGCACG GGGGCCGACA CCTTCCTGTT CT - #CCGCGCGT      180     - ACCGACAGCT ACCGCACCGA CAGCGCCAGC TTCACCGACC TGATCACCGA CT - #TCGATCCC      240     - GCCCAGGATC GCATCGACCT GTCCGGCCTG GGCTTCAGCG GTTTCGGCAA CG - #GCTACGAC      300     - GGCACCCTGC TGCTGCAGGT CAACGCCGCG GGCACCCGCA CCTACCTGAA GA - #GCCTCGAG      360     - GCCGATGCCG ACGGCCAGCG CTTCGAGATC GCCCTGGACG GCGACTTCAG CG - #GCCAGTTG      420     #          450     TCTT CGAGGCCGGC     - (2) INFORMATION FOR SEQ ID NO:41:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 142 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:41:     - Leu Val Gly Gly Asp Thr Asp Asp Gln Leu Gl - #n Gly Gly Ser Gly Ala     #                15     - Asp Arg Leu Asp Gly Gly Ala Gly Asp Asp Il - #e Leu Asp Gly Gly Ala     #            30     - Gly Arg Asp Arg Leu Ser Gly Gly Ala Gly Al - #a Asp Thr Phe Val Phe     #        45     - Ser Ala Arg Glu Asp Ser Tyr Arg Thr Asp Th - #r Ala Val Phe Asn Asp     #    60     - Leu Ile Leu Asp Phe Glu Ala Ser Glu Asp Ar - #g Ile Asp Leu Ser Ala     #80     - Leu Gly Phe Ser Gly Leu Gly Asp Gly Tyr Gl - #y Gly Thr Leu Leu Leu     #                95     - Lys Thr Asn Ala Glu Gly Thr Arg Thr Tyr Le - #u Lys Ser Phe Glu Ala     #           110     - Asp Ala Glu Gly Arg Arg Phe Glu Val Ala Le - #u Asp Gly Asp His Thr     #       125     - Gly Asp Leu Ser Ala Ala Asn Val Val Phe Al - #a Ala Thr Gly     #   140     - (2) INFORMATION FOR SEQ ID NO:42:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 151 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:     - Leu Glu Gly Ser Ala Gly Asn Asp Ala Leu Se - #r Gly Thr Glu Ala His     #                15     - Glu Thr Leu Leu Gly Gln Ala Gly Asp Asp Ar - #g Leu Asn Gly Asp Ala     #            30     - Gly Asn Asp Ile Leu Asp Gly Gly Ala Gly Ar - #g Asp Asn Leu Thr Gly     #        45     - Gly Ala Gly Ala Asp Thr Phe Arg Phe Ser Al - #a Arg Thr Asp Ser Tyr     #    60     - Arg Thr Asp Ser Ala Ser Phe Asn Asp Leu Il - #e Thr Asp Phe Asp Ala     #80     - Asp Glu Asp Ser Ile Asp Leu Ser Ala Leu Gl - #y Phe Thr Gly Leu Gly     #                95     - Asp Gly Tyr Asn Gly Thr Leu Leu Leu Lys Th - #r Asn Ala Glu Gly Thr     #           110     - Arg Thr Tyr Leu Lys Ser Tyr Glu Ala Asp Al - #a Gln Gly Arg Arg Phe     #       125     - Glu Ile Ala Leu Asp Gly Asn Phe Thr Gly Le - #u Phe Asn Asp Asn Asn     #   140     - Leu Leu Phe Asp Ala Ala Pro     145                 1 - #50     - (2) INFORMATION FOR SEQ ID NO:43:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 153 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43:     - Ala Thr Gly Thr Glu Gly Ser Asp Asn Leu Le - #u Gly Thr Asp Ala Gly     #                15     - Glu Thr Leu Leu Gly Tyr Gly Gly Asn Asp Th - #r Leu Asn Gly Gly Ala     #            30     - Gly Asp Asp Ile Leu Val Gly Gly Ala Gly Ar - #g Asp Ser Leu Thr Gly     #        45     - Gly Ala Gly Ala Asp Val Phe Arg Phe Asp Al - #a Leu Ser Asp Ser Gln     #    60     - Arg Asn Tyr Thr Thr Gly Asp Asn Gln Ala As - #p Arg Ile Leu Asp Phe     #80     - Asp Pro Thr Leu Asp Arg Ile Asp Val Ser Al - #a Leu Gly Phe Thr Gly     #                95     - Leu Gly Asn Gly Arg Asn Gly Thr Leu Ala Va - #l Val Leu Asn Ser Ala     #           110     - Gly Asp Arg Thr Asp Leu Lys Ser Tyr Asp Th - #r Asp Ala Asn Gly Tyr     #       125     - Ser Phe Glu Leu Ser Leu Ala Gly Asn Tyr Gl - #n Gly Gln Leu Ser Ala     #   140     - Glu Gln Phe Val Phe Ala Thr Ser Gln     145                 1 - #50     - (2) INFORMATION FOR SEQ ID NO:44:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 150 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44:     - Ile Glu Gly Thr Asp Gly Asn Asp Thr Leu Gl - #n Gly Thr Glu Ala Asn     #                15     - Glu Arg Leu Leu Gly Leu Asp Gly Arg Asp As - #n Leu Asn Gly Gly Ala     #            30     - Gly Asp Asp Ile Leu Asp Gly Gly Ala Gly Ar - #g Asp Thr Leu Thr Gly     #        45     - Gly Thr Gly Ala Asp Thr Phe Leu Phe Ser Th - #r Arg Thr Asp Ser Tyr     #    60     - Arg Thr Asp Ser Ala Ser Phe Asn Asp Leu Il - #e Thr Asp Phe Asp Pro     #80     - Thr Gln Asp Arg Ile Asp Leu Ser Gly Leu Gl - #y Phe Ser Gly Phe Gly     #                95     - Asn Gly Tyr Asp Gly Thr Leu Leu Leu Gln Va - #l Asn Ala Ala Gly Thr     #           110     - Arg Thr Tyr Leu Lys Ser Phe Glu Ala Asp Al - #a Asn Gly Gln Arg Phe     #       125     - Glu Ile Ala Leu Asp Gly Asp Phe Ser Gly Gl - #n Leu Asp Ser Gly Asn     #   140     - Val Ile Phe Glu Pro Ala     145                 1 - #50     - (2) INFORMATION FOR SEQ ID NO:45:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 149 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45:     - Gly Thr Asp Gly Asn Asp Val Leu Val Gly Se - #r Asp Ala Asn Asp Gln     #                15     - Leu Tyr Gly Gly Ala Gly Asp Asp Arg Leu As - #p Gly Gly Ala Gly Asp     #            30     - Asp Leu Leu Asp Gly Gly Ala Gly Arg Asp As - #p Leu Thr Gly Gly Thr     #        45     - Gly Ala Asp Thr Phe Val Phe Ala Ala Arg Th - #r Asp Ser Tyr Arg Thr     #    60     - Asp Ala Gly Val Phe Asn Asp Leu Ile Leu As - #p Phe Asp Ala Ser Glu     #80     - Asp Arg Ile Asp Leu Ser Ala Leu Gly Phe Se - #r Gly Phe Gly Asp Gly     #                95     - Tyr Asn Gly Thr Leu Leu Val Gln Leu Ser Se - #r Ala Gly Thr Arg Thr     #           110     - Tyr Leu Lys Ser Tyr Glu Glu Asp Leu Glu Gl - #y Arg Arg Phe Glu Val     #       125     - Ala Leu Asp Gly Asp His Thr Gly Asp Leu Se - #r Ala Ala Asn Val Val     #   140     - Phe Ala Asp Asp Gly     145     - (2) INFORMATION FOR SEQ ID NO:46:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 151 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:46:     - Leu Glu Gly Thr Ala Gly Asn Asp Thr Leu Gl - #y Gly Ser Asp Ala His     #                15     - Glu Thr Leu Leu Gly Leu Asp Gly Asn Asp Ar - #g Leu Asn Gly Gly Ala     #            30     - Gly Asn Asp Ile Leu Asp Gly Gly Ala Gly Ar - #g Asp Asn Leu Thr Gly     #        45     - Gly Ala Gly Ala Asp Leu Phe Arg Val Ser Al - #a Arg Thr Asp Ser Tyr     #    60     - Arg Thr Asp Ser Ala Ser Phe Asn Asp Leu Il - #e Thr Asp Phe Asp Ala     #80     - Ser Gln Asp Arg Ile Asp Leu Ser Ala Leu Gl - #y Phe Thr Gly Leu Gly     #                95     - Asp Gly Tyr Asn Gly Thr Leu Leu Leu Gln Va - #l Ser Ala Asp Gly Ser     #           110     - Arg Thr Tyr Leu Lys Ser Leu Glu Ala Asp Al - #a Glu Gly Arg Arg Phe     #       125     - Glu Ile Ala Leu Asp Gly Asn Phe Ala Gly Le - #u Leu Gly Ala Gly Asn     #   140     - Leu Leu Phe Glu Arg Thr Ala     145                 1 - #50     - (2) INFORMATION FOR SEQ ID NO:47:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 153 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47:     - Ile Glu Gly Asp Ala Gly Asp Asn Ala Leu Le - #u Gly Thr Ser Ala Ala     #                15     - Glu Thr Leu Leu Gly His Ala Gly Asn Asp Th - #r Leu Asp Gly Gly Ala     #            30     - Gly Asp Asp Ile Leu Val Gly Gly Ala Gly Ar - #g Asp Ser Leu Thr Gly     #        45     - Gly Ala Gly Ala Asp Val Phe Arg Phe Asp Al - #a Leu Ser Asp Ser Gln     #    60     - Arg Asn Tyr Asp Ile Gly Asp Asn Gln Gly As - #p Arg Ile Ala Asp Phe     #80     - Ala Val Gly Glu Asp Lys Leu Asp Val Ser Al - #a Leu Gly Phe Thr Gly     #                95     - Leu Gly Asp Gly Tyr Asn Gly Thr Leu Ala Le - #u Val Leu Asn Ser Ala     #           110     - Gly Asp Arg Thr Tyr Val Lys Ser Tyr Glu As - #n Gly Ala Asp Gly Tyr     #       125     - Arg Phe Glu Phe Ser Leu Asp Gly Asn Tyr Le - #u Glu Leu Leu Gly Asn     #   140     - Glu Asp Phe Ile Phe Ala Thr Pro Ser     145                 1 - #50     - (2) INFORMATION FOR SEQ ID NO:48:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 132 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48:     - Leu Glu Gly Ser Ala Gly Asn Asp Ser Leu Gl - #n Gly Thr Ala Ala Asp     #                15     - Glu Val Ile His Gly Gly Gly Gly Arg Asp Th - #r Leu Ala Gly Gly Ala     #            30     - Gly Ala Asp Val Phe Arg Phe Ser Glu Leu Th - #r Asp Ser Tyr Arg Asp     #        45     - Ser Ala Ser Tyr Ala Asp Leu Ile Thr Asp Ph - #e Asp Ala Ser Glu Asp     #    60     - Arg Ile Asp Leu Ser Gly Leu Gly Phe Ser Gl - #y Leu Gly Asn Gly Tyr     #80     - Gly Gly Thr Leu Ala Leu Gln Val Asn Ser Al - #a Gly Thr Arg Thr Tyr     #                95     - Leu Lys Ser Phe Glu Thr Asn Ala Ala Gly Gl - #u Arg Phe Glu Ile Ala     #           110     - Leu Asp Gly Asp Leu Ser Ala Leu Gly Gly Al - #a Asn Leu Ile Leu Asp     #       125     - Ala Arg Thr Val         130     - (2) INFORMATION FOR SEQ ID NO:49:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 153 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49:     - Leu Ala Gly Gly Asp Gly Asn Asp Thr Leu Se - #r Gly Ser Ser Ala Ala     #                15     - Glu Glu Leu Leu Gly Gly Val Gly Asn Asp Se - #r Leu Asp Gly Gly Ala     #            30     - Gly Asn Asp Ile Leu Asp Gly Gly Ala Gly Ar - #g Asp Thr Leu Ser Gly     #        45     - Gly Ser Gly Ser Asp Ile Phe Arg Phe Gly Gl - #y Ala Leu Asp Ser Phe     #    60     - Arg Asn Tyr Ala Ser Gly Thr Asn Gly Thr As - #p Ser Ile Thr Asp Phe     #80     - Thr Pro Gly Glu Asp Leu Ile Asp Leu Ser Va - #l Leu Gly Tyr Thr Gly     #                95     - Leu Gly Asp Gly Tyr Asn Gly Thr Leu Ala Il - #e Val Leu Asn Asp Ala     #           110     - Gly Thr Lys Thr Tyr Leu Lys Asn Arg Glu Se - #r Asp Ala Glu Gly Asn     #       125     - Gln Phe Glu Ile Ala Leu Glu Gly Asn His Al - #a Asp Gln Leu Asp Ala     #   140     - Ser Asp Phe Ile Phe Ala Thr Ala Ala     145                 1 - #50     - (2) INFORMATION FOR SEQ ID NO:50:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 151 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:50:     - Leu Glu Gly Thr Ala Gly Asn Asp Val Leu Se - #r Gly Thr Gly Ala His     #                15     - Glu Leu Ile Leu Gly Leu Ala Gly Asn Asp Ar - #g Leu Asp Gly Gly Ala     #            30     - Gly Asp Asp Thr Leu Asp Gly Gly Ala Gly Ar - #g Asp Thr Leu Thr Gly     #        45     - Gly Ala Gly Ala Asp Thr Phe Arg Phe Ser Al - #a Arg Glu Asp Ser His     #    60     - Arg Thr Asp Ser Ala Ser Phe Thr Asp Leu Il - #e Thr Asp Phe Asp Ala     #80     - Ser Gln Asp Arg Ile Asp Leu Ser Ala Leu Gl - #y Phe Thr Gly Leu Gly     #                95     - Asn Gly Tyr Asp Gly Thr Leu Ala Val Thr Th - #r Gly Ser Gly Gly Thr     #           110     - Arg Thr Tyr Leu Lys Ser Tyr Glu Val Asp Al - #a Gln Gly Arg Arg Phe     #       125     - Glu Ile Ala Leu Asp Gly Asn Phe Val Gly Gl - #n Phe Asn Asp Gly Asn     #   140     - Leu Leu Phe Asp Ala Ala Pro     145                 1 - #50     - (2) INFORMATION FOR SEQ ID NO:51:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 153 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:51:     - Val Thr Gly Thr Glu Gly Asn Asp Asn Leu Se - #r Gly Thr Asp Ala Gly     #                15     - Glu Thr Leu Leu Gly Tyr Gly Gly Asn Asp Th - #r Leu Asn Gly Gly Ala     #            30     - Gly Asn Asp Ile Leu Val Gly Gly Ala Gly Ar - #g Asp Thr Leu Thr Gly     #        45     - Gly Ala Gly Ala Asp Val Phe Arg Phe Glu Al - #a Leu Ser Asp Ser Gln     #    60     - Arg Asn Tyr Thr Ala Gly Asp Asn Gln Gly As - #p Tyr Ile Ile Asp Phe     #80     - Ala Val Gly Glu Asp Arg Ile Asp Val Ser Al - #a Leu Gly Tyr Thr Gly     #                95     - Leu Gly Asn Gly Arg Asn Gly Thr Leu Ala Va - #l Val Leu Asn Ser Ala     #           110     - Gly Asp Arg Thr Tyr Val Lys Ser Tyr Asp Th - #r Asp Ala Asn Gly Tyr     #       125     - Asn Phe Glu Leu Ser Leu Ala Gly Asn Tyr Gl - #n Gly Leu Leu Gly Ala     #   140     - Glu Gln Phe Val Phe Ala Thr Pro Pro     145                 1 - #50     - (2) INFORMATION FOR SEQ ID NO:52:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 150 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:52:     - Ile Glu Gly Thr Asp Gly Asn Asp Ser Leu Gl - #n Gly Thr Gly Ala Asp     #                15     - Glu Leu Leu Leu Gly Leu Gly Gly Arg Asp Se - #r Leu Asn Gly Gly Ala     #            30     - Gly Asp Asp Val Leu Asp Gly Gly Ala Glu Ar - #g Asp Thr Leu Thr Gly     #        45     - Gly Thr Gly Ala Asp Thr Phe Leu Phe Ser Al - #a Arg Thr Asp Ser Tyr     #    60     - Arg Thr Asp Ser Ala Ser Phe Thr Asp Leu Il - #e Thr Asp Phe Asp Pro     #80     - Ala Gln Asp Arg Ile Asp Leu Ser Gly Leu Gl - #y Phe Ser Gly Phe Gly     #                95     - Asn Gly Tyr Asp Gly Thr Leu Leu Leu Gln Va - #l Asn Ala Ala Gly Thr     #           110     - Arg Thr Tyr Leu Lys Ser Leu Glu Ala Asp Al - #a Asp Gly Gln Arg Phe     #       125     - Glu Ile Ala Leu Asp Gly Asp Phe Ser Gly Gl - #n Leu Asp Ser Gly Asn     #   140     - Val Ile Phe Glu Ala Gly     145                 1 - #50     __________________________________________________________________________ 

We claim:
 1. A DNA compound comprising a sequence encoding mannuronan C- 5-epimerase, said sequence comprising one or more sequences selected from the group consisting of a DNA A block comprising the DNA of SEQ ID NO:7 or any one of SEQ ID NOS:17-22, or a DNA molecule encoding an amino acid sequences comprising an amino acid sequence of SEQ ID NO:8 or any one of SEQ ID NOS:23-28, or a DNA molecule which hybridizes to consensus sequence SEQ ID NO:7 under high stringency conditions comprising a wash step of 3.2 M tetramethylammonium chloride at 50° C.; and a DNA R block comprising the DNA of SEQ ID NO:9 or any one of SEQ ID NOS:29-40, or a DNA molecule encoding an amino sequence comprising an amino acid sequence of SEQ ID NO:10 or any one of SEQ ID NOS:41-52, or a DNA molecule which hybridizes to consensus sequence SEQ ID NO:9 under high stringency conditions comprising a wash step of 3.2 M tetramethylammonium chloride at 50° C.; and being isolated from a natural source or being derived synthetically.
 2. An isolated DNA compound according to claim 1 encoding mannuronan C-5-epimerase that comprises at least one DNA block A comprising the DNA of SEQ ID NO:7 or any one of SEQ ID NOS:17-22, and a DNA block R comprising the DNA of SEQ ID NO:9 or any one of SEQ ID NOS:29-40 in a number of 0 to
 5. 3. An isolated DNA compound comprising at least a portion of the DNA sequence as set forth in SEQ ID NO:1 wherein said compound comprises one or more sequences encoding a DNA A block comprising the DNA of SEQ ID NO:7 or any one of SEQ ID NOS:17-22, or a DNA molecule encoding an amino acid sequence comprising an amino acid sequence of SEQ ID NO:8 or any one of SEQ ID NOS:23-28, or a DNA molecule which hybridizes to consensus sequence SEQ ID NO:7 under high stringency conditions comprising a wash step of 3.2 M tetramethylammonium chloride at 50° C.; and/or a DNA R block comprising the DNA of SEQ ID NO:9 or any one of SEQ ID NOS:29-40, or a DNA molecule encoding an amino sequence comprising an amino acid sequence of SEQ ID NO:10 or any one of SEQ ID NOS:41-52, or a DNA molecule which hybridizes to consensus sequence SEQ ID NO:9 under high stringency conditions comprising a wash step of 3.2 M tetramethylammonium chloride at 50° C.
 4. An isolated DNA compound according to claim 1 that encodes at least a portion of the protein amino acid sequence as set, said sequence containing SEQ ID NOS:2,3,4 and
 5. 5. A recombinant DNA vector that comprises the DNA sequence of claim
 1. 6. A method for constructing a recombinant host cell for production of enzymes having mannuronan C-5-epimerase activity and/or the microbiological production of alginates which comprises transforming a host cell with the DNA of claim
 1. 7. The method according to claim 6 which comprises the microbiological production of alginates having a high G block content of 75-98%.
 8. The method according to claim 6 which comprises the microbiological production of alginates wherein the DNA sequence is selected to produce an alginate having a desired M/G block content.
 9. A method for the production of enzymes having mannuronan C-5-epimerase activity comprising constructing a recombinant host cell capable of expressing epimerase activity, characterized by transforming said host cell with a recombinant DNA expression vector that comprises:(a) a promoter and translational activating sequence that function in said host cell; and (b) a DNA sequence encoding mannuronan C-5-epimerase comprising a DNA block A comprising the DNA of SEQ ID NO:7 or any one of SEQ ID NOS:17-22, or a DNA molecule encoding an amino acid sequence comprising an amino acid sequence of SEQ ID NO:8 or any one of SEQ ID NOS:23-28, or a DNA molecule which hybridizes to consensus sequence SEQ ID NO:7 under high stringency conditions comprising a wash step of 3.2 M tetramethylammonium chloride at 50° C. and/or a DNA block R comprising the DNA of SEQ ID NO:9 or any one of SEQ ID NOS:29-40, or a DNA molecule encoding an amino sequence comprising an amino acid sequence of SEQ ID NO:10 or any one of SEQ ID NOS:41-52, or a DNA molecule which hybridizes to consensus sequence SEQ ID NO:9 under high stringency conditions comprising a wash step of 3.2 M tetramethylammonium chloride at 50° C., positioned for expression from said promoter and translational activity sequence.
 10. A process for the bacterial production of alginates comprising selectively inactivating one or more DNA sequences containing at least one A block comprising the DNA of SEQ ID NO:7 or any one of SEQ ID NOS:17-22, or a DNA molecule encoding an amino acid sequence comprising an amino acid sequence of SEQ ID NO:8 or any one of SEQ ID NOS:23-28, or a DNA molecule which hybridizes to consensus sequence SEQ ID NO:7 under high stringency conditions comprising a wash step of 3.2 M tetramethylammonium chloride at 50° C., or R block comprising the DNA of SEQ ID NO:9 or any one of SEQ ID NOS:29-40, a a DNA molecule encoding an amino sequence comprising an amino acid sequence of SEQ ID NO:10 or any one of SEQ ID NOS:41-52, or a DNA molecule which hybridizes to consensus sequence SEQ ID NO:9 under high stringency conditions comprising a wash step of 3.2 M tetramethylammonium chloride at 50° C. and encoding mannuronan C-5-epimerases in a natural host cell wherein said mannuronan C-5-epimerases produce alginates.
 11. Process according to claim 10 for the production of pure poly-M alginate or alginates having a G block content from 0 to 25%.
 12. Process according to claim 10 for the production of alginates wherein said DNA sequences are selectively inactivated to produce alginates having a desired M/G block content and G block distribution.
 13. A DNA compound according to claim 1 wherein said compound is not naturally occurring in an alginate producing host and comprises at least one A block comprising the DNA of SEQ ID NO:7 or any one of SEQ ID NOS:17-22 and at least one R block comprising the DNA of SEQ ID NO:9 or any one of SEQ ID NOS:29-40 isolated from a natural source or being derived synthetically.
 14. A DNA compound encoding a mannuronan C-5-epimerase, wherein said C-5-epimerase comprises an amino acid sequence corresponding to that of a mannuronan C-5-epimerase produced by Azotobacter vinelandii.
 15. A DNA compound comprising a sequence encoding mannuronan C-5-epimerase, said sequence comprising one or more sequences of a DNA A block comprising the DNA of SEQ ID NO:7 or any one of SEQ ID NOS:17-22, or a DNA molecule encoding an amino acid sequence comprising an amino acid sequence of SEQ ID NO:8 or any one of SEQ ID NOS:23-28, or a DNA molecule which hybridizes to consensus sequence SEQ ID NO:7 under high stringency conditions comprising a wash step of 3.2 M tetramethylammonium chloride at 50° C. and being isolated from a natural source or being derived synthetically.
 16. The DNA compound of claim 15 further comprising one or more sequences of a DNA R block comprising the DNA of SEQ ID NO:9 or any one of SEQ ID NOS:29-40, or a DNA molecule encoding an amino sequence comprising an amino acid sequence of SEQ ID NO:10 or any one of SEQ ID NOS:41-52, or a DNA molecule which hybridizes to consensus sequence SEQ ID NO:9 under high stringency conditions comprising a wash step of 3.2 M tetramethylammonium chloride at 50° C. 